Smart Data Analytics Research Group Overview

Slide Note
Embed
Share

This content provides an overview of the Smart Data Analytics Research Group at the Jülich Supercomputing Center and the University of Iceland. It covers their research areas such as classification, clustering, deep learning, and more. The group focuses on developing scalable tools and platforms for data analytics applications in scientific use cases. Collaboration with various institutions and the application of machine learning algorithms are highlighted.


Uploaded on Dec 16, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Big Data Interest Group Smart Data Analytics Markus G tz J lich Supercomputing Center (JSC) // University of Iceland Member of the Helmholtz Association Morris Riedel J lich Supercomputing Center (JSC) // University of Iceland 03/10/2015 | RDA Fifth Plenary Meeting | San Diego, USA | Paradise Point Resort

  2. Outline Introduction Research Group, Research Area Smart Data Analytics Use Cases and Techniques Classification, Land Cover Type, piSVM Clustering, Drunken Flies , HPDBSCAN Deep-Learning, Cortex Layers, pylearn CNN Conclusion Results and RDA Context Member of the Helmholtz Association 03/10/2015 Markus G tz | Smart Data Analytics | Forschungszentrum J lich 2

  3. Introduction Research Group J lich Supercomputing Center (HPC/HTC) High Productivity Data Processing Group Parallel Data Analytics Data Mining Methods Machine Learning Algorithms Smart Data Analytics Research Area Smart Data Analytics Methods Evaluation and Development of Scalable Tools Processing Platform Requirements Application in Scientific Use Case Scientific Community Application Data Analzsis Tools Generic Data Methods Member of the Helmholtz Association 03/10/2015 Markus G tz | Smart Data Analytics | Forschungszentrum J lich 3

  4. Classification // Land Cover Type Land Cover Type Problem Collaboration with University of Iceland Determine Land Cover Type in Satellite Images Different Types - Road, Building, Vegetation, Classification Supervised Learning Technique Known Set of Groups or Classes Determine Membership of New Items Member of the Helmholtz Association 03/10/2015 Markus G tz | Smart Data Analytics | Forschungszentrum J lich 4

  5. Classification // Land Cover Type Approach Support Vector Machines (SVM) Existing Solution: piSVM (MPI) In-house Optimization of Parallel Code Member of the Helmholtz Association Inertia Standard deviation Area 03/10/2015 Markus G tz | Smart Data Analytics | Forschungszentrum J lich 5

  6. Clustering // Drunken Flies Drunken Flies Collaboration with University of Cologne Investigate Influence of Genetics on Alcohol Consumption Literally Make Flies Drunk Clustering Unsupervised Learning Technique Subdivide Database into Similar Groups Similarity Metrics Member of the Helmholtz Association 03/10/2015 Markus G tz | Smart Data Analytics | Forschungszentrum J lich 6

  7. Clustering // Drunken Flies Approach Image Processing Pipeline HPDBSCAN In-house Development (MPI+OpenMP) Member of the Helmholtz Association 03/10/2015 Markus G tz | Smart Data Analytics | Forschungszentrum J lich 7

  8. Deep Learning // Cortex Layers Cortex Layer Problem Institute for Neuro-Medicine (INM) at FZJ Segment the Seven Layers of the Cortex Images of Actual Brain Slices Each Gigabytes (60k square resolution) Deep Learning Supervised Learning Technique (Classification) More Advanced Mathemical Models Various Flavors of Neural Networks Member of the Helmholtz Association 03/10/2015 Markus G tz | Smart Data Analytics | Forschungszentrum J lich 8

  9. Deep Learning // Cortex Layers Approach Convolutional Neural Networks Existing Serial Toolkit Pylearn 2/Theano Scaling Issues Member of the Helmholtz Association 03/10/2015 Markus G tz | Smart Data Analytics | Forschungszentrum J lich 9

  10. Conclusion Results Big Data Challenge is Real! Gap between Analytics Requirements and Actual Implementations Interest for RDA Code is Open-source @ GitHub and Bitbucket Data is Open and Freely Published @ B2SHARE Choice of Dataformats Question of Future Processing Platforms Member of the Helmholtz Association 03/10/2015 Markus G tz | Smart Data Analytics | Forschungszentrum J lich 10

  11. Thanks you for the attention Fifth Plenary Meeting 08 12 March 2015 San Diego, USA | Paradise Point Resort Member of the Helmholtz Association Contact: m.goetz@fz-juelich.de Slides: Big Data IG > Wiki > 5th Plenary 03/10/2015 Markus G tz | Smart Data Analytics | Forschungszentrum J lich 11

Related


More Related Content