Visual Analytics Sandbox for Big Data Insights

visual analytics sandbox n.w
1 / 25
Embed
Share

Explore the Visual Analytics Sandbox for advanced data processing, efficient resource management, and real-time decision-making in a Big Data environment. Discover tools for data ingestion, storage, processing, and visualization within the VA Sandbox. Accessible through university networks, this platform empowers users to analyze big data effectively.

  • Analytics
  • Data Insights
  • Big Data
  • Visualization
  • Resource Management

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

You are allowed to download the files provided on this website for personal or commercial use, subject to the condition that they are used lawfully. All files are the property of their respective owners.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author.

E N D

Presentation Transcript


  1. Visual Analytics Sandbox Satya Katragadda January 25, 2018

  2. Agenda Why Big Data? Goals Visual Analytics Sandbox Traditional Workflow in a Big Data Environment VA Sandbox: Software Stack VA Sandbox: Execution Examples

  3. Why Big Data? Reports, e.g., Track business processes, transactions Diagnosis, e.g., Why is user engagement dropping? Why is the system slow? Detect spam, worms, viruses, DDoS attacks Decisions, e.g., Decide what feature to add Decide what ad to show Block worms, viruses,

  4. Goals Low latency (interactive) queries on historical data: enable faster decisions E.g., identify why a site is slow and fix it Low latency queries on live data (streaming): enable decisions on real-time data E.g., detect & block worms in real-time (a worm may infect 1mil hosts in 1.3sec) Sophisticated data processing: enable better decisions E.g., anomaly detection, trend analysis

  5. Visual Analytics Sandbox

  6. Big Data Workflow Data Ingestion Data Management Data Processing Visualization Resource Management

  7. VA Sandbox: Software Stack

  8. VA Sandbox: Resource Manager

  9. VA Sandbox: Data Injestion

  10. VA Sandbox: Data Storage

  11. VA Sandbox: Processing and Visualization

  12. VA Sandbox Stephens Hall Accessible through university network

  13. VA Sandbox: Access

  14. VA Sandbox: Execution

  15. VA Sandbox: Input

  16. VA Sandbox: Spark Script

  17. VA Sandbox: Spark Output

  18. Alternative Execution Environment HUE: Hadoop User Experience An open-source Web interface that supports Apache Hadoop and its ecosystem Component Applications Editor SQL, Pig, Spark Browsers YARN, Oozie, Impala, HBase, Livy Scheduler Oozie Dashboard Solr, SQL (Impala, Hive...)

  19. HUE: File Browser

  20. HUE: Job Execution

  21. HUE: Output

  22. HUE: Editors

  23. HUE: Schedulers

  24. HUE: Dashboards

  25. Questions? Satya Katragadda RM 118, Abdalla Hall satya@Louisiana.edu

Related


More Related Content