Problem Areas Addressed by Phil Bernstein at Microsoft Research

four problem areas phil bernstein microsoft l.w
1 / 6
Embed
Share

Explore the four key problem areas tackled by Phil Bernstein at Microsoft Research, including ad hoc querying of non-integrated databases, database as a service, software/hardware co-design for databases, and handling the tidal wave of data. Learn about the challenges and innovative solutions in database technology research.

  • Database Technology
  • Data Analysis
  • Microsoft Research
  • Phil Bernstein
  • Data Management

Uploaded on | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. Four Problem Areas Phil Bernstein Microsoft Research

  2. List of Problems Ad hoc querying of non-integrated databases. Database as a service. Hardware/software co-design Dealing with the tidal wave of data

  3. Ad hoc querying of non-integrated data Enterprise data sources: data warehouses, SQL databases, key-value stores, spreadsheets, SharePoint sites, event logs, internet sites, etc. To answer a given business question, many of these sources may be relevant Top 10 product improvements that would most please my top 25 customers. Requires looking at call center logs, blogs, product marketing input, search logs on the company's support website, etc. Today, this is much too time-consuming for a data analyst to figure out We need better tools for data discovery, data profiling, deriving relationships and other semantics, dealing with dirty data, and formulating queries. We have components technologies. Why is it still so hard?

  4. Database as a Service For a large fraction of users, a database will be a service. What are the best forms for this to take? How best to support multi-tenancy? Resource governance within a VM or across VM s Multi-tenants within a database (e.g., tenant-specific query processing, schema diversity and evolution, and security) Minimize labor in keeping the service up What to measure? How to use the data? Fast root-cause analysis of problems.

  5. Software/Hardware Co-Design for Databases Trends that are driving hardware innovation for databases Limited improvements in single-threaded performance Custom hardware development by cloud providers Popularity of database appliances Areas that could benefit from custom hardware Transaction processing, encryption, query execution Hardware technology that s worth considering Transactional memory, fast datacenter networks, GPU s, FPGA s

  6. Dealing with the Tidal Wave of Data Internet of things, mobile phones, massive on-line games, telemetry from industrial systems The scenario: Process some as a virtual world of objects Process some as streams Save data for later, off-line analysis Problems programming model, scalability, fault tolerance, security, geo-distribution,

More Related Content