Introduction to IBM Watson Explorer in Business Intelligence: University of Rome La Sapienza Course

 
Course Lab
Introduction to IBM Watson Explorer
 
Ing. Vittorio Carullo
IBM Italia
v.carullo@it.ibm.com
 
University of Rome «La Sapienza»
Course of Business Intelligence - 2017
 
Our Target
 
Familiarize with a «real» software used in large
organizations
 
Accomplish small but significant use cases in BI arena
 
Introduce advanced topics in BI
 like the use of “non
structured” information
 
Lab Schedule
 
Lab sessions on Tuesday from October 17, 2017 , 4 - 6 pm
 
Presentation of the Watson Explorer tool and its basic features
 
(1 – 1.5 sessions)
 
Use of the tool for conducting standard BI use cases
 
(2 – 2.5 sessions)
 
Use of the tool for Advanced Content Analytics
 
(2 sessions)
 
Reference Materials
 
IBM Redbook on Watson Content Analytics
http://www.redbooks.ibm.com/abstracts/sg247877.html?Open
Suggested chapters: 1-6. Further chapters are more «technical»
 
IBM Knowledge Center
https://www.ibm.com/support/knowledgecenter/SS8NLW_11.0.2/co
m.ibm.discovery.es.nav.doc/explorer_analytics.htm
Use it just as a technical reference for product features
 
Today’s Contents: Advanced Content Mining
 
1.
Using Dashboards
1.
Lab 4.1 – The Big Boss Request
2.
Text Analytics basics
3.
TBD
 
Lab 3.2 – TBDLAb
4. TBD 2
 
 
1. Using Dashboards
 
The Dashboard View
 
The Dashboard view shows various predefined charts and tables in a single
view.
 
 
Role of the Dashboard
 
With this view, you can
 
Visualize at one glance various aspects of the data to quickly interpret,
analyze, and further investigate
 
D
efine and show KPIs (Key Performance Indicators) for your sets that can
directly give some evidence about your data
 
Share the data with other people for collaboration purposes (authoring
reports, etc.).
 
Dashboard features
 
Query – driven results
Unlike a static report, the results of the Dashboard charts are dynamically
updated whenever you change a filter
 
I
nteractive navigation
I
n a dashboard, graphics are navigable and tied one with another. For instance,
if an user clicks on a pie slice, the system automatically performs a query in
real time to drill-down results, and all other graphs are updated in real time
 
Dashboard components
 
Dashboard View is composed by arranging  a set of 
widgets
 in a predefined
layout
Each widget renders a representation 
of the analyzed data using a typical
graphic metaphore
 
Widget examples:
 
Charts (bar/column/pie/bubble,….),
Heat maps
Word clouds
…..
Exploring the Dashboard View
Layout
Selector
Widgets
Action
Selector
 
Creating / Customizing Dashboard
Dashboard View is composed by arranging  a set of 
widgets
 in a predefined
layout
Each widget renders a representation 
of the analyzed data using a typical
graphic metaphore
 
Widget examples:
 
Charts (bar/column/pie/bubble,….)
Heat maps
Word clouds
…..
 
 
Dashboard Actions Menu
 
New
: Create a new dashboard form scratch
Save
: Save modifications to current dashboard
Save As
: Save modifications to a new dashboard
Customize
: Open the Dashboard Editor to
change current dashboard
Delete
: Delete current dashboard
Refresh
: Refresh the state of current dashboard
 
Create a new Dashboard
 
From 
Actions 
menu, select New.
Type a name to identify the newly created dashboard
 
Layouts
 
Layouts provide two or more «containers» that define the
graphic arrangement of the widgets.
It is possible to choose an initial layout and then change it
subsequently
 
Dashboard Customizer
 
Use the dashboard customizer to
create widgets and arrange them
into containers
A container can contain one or
more widgets
Widgets have an external controller
that handles graphical aspects and
an internal controller that contain
widgets configuration
 
Container: external controller
 
Container: internal controller
 
Adding widgets to container
 
WEX provides several types
of widgets that can be
added to dashboards
Some widgets are the same
present in other Content
Miner views (Facets,
Trends, etc)
RAVE (
Rapidly Adaptive
Visualization Engine) 
charts
are very handy for creating
useful KPIs
 
RAVE Charts: Important settings
 
RAVE Template: the
graphical form of the
widget
Facet to analyze: specify
the facet that has to be
visualized
Sort by: Count first or
Correlation first
How many facet to display
and to analyze for
calculation
 
Hands On: Lab 4.1
 
- The Big Boss Request -
I want to know 
EVERYTHING
 about consumer
complaints:
What are most frequent issues
In what US state
What are most problematic companies and
how they behaved during the year
If they respond on time
 
ALL with drill-down
ALL in one screen
 
 
Lab 4.1 Hints & Tips
 
Create a dashboard to satisfy the Big Boss Request
Use a two columns layout
Put two widgets in each container
Use Correlation Rave Chart for count Issues, States and
Timely Responses
Pie charts are OK when there are few values
For more values use column/bar chart
For a lot of values use word cloud
Use Trends Rave Chart for Company performances
during time (set time scale to Month)
 
Content Miner Link
http://172.31.1.2:8393/ui/analytics
 
 
2. Text Analytics
 
Remember the «body»?
 
During last lesson, we focused on the need to define as text 
body
 the most
significant part of our data in descriptive terms
Examples
Tweets  
   
-> body = message
Customer complaints 
 
-> body = problem description
Quality assurance
 
-> body = product feedback
….
 
Body  is automatically set as 
Analyzable
 index field during Watson Explorer
collection configuration
 
 
What happens to Analyzable text?
 
Index fields set as 
analyzable
 are processed in a special way during indexing
phase by a WEX component called 
Document Processor
 
In order to provide text analysis, Document Processor uses a pipeline of
processing «blocks» called 
annotators, 
each one specialized for a particular
analytical task.
 
The final purpose is to enrich text documents with the most complete
possible set of 
analytical facets
, i.e. metadata related to syntax, semantics
and meaning of the text
 
Analytical facets may be used, together with other structured facets, to
search/filter/examine content and discover insights in an even more
powerful fashion
 
Document processing pipeline
 
Document processing pipeline / 1
 
Language Identification Annotator
: understands and takes note of language used. A wide
variety of languages are supported.
 
Linguistic Analysis Annotator 
: Based on the identified language, linguistic structures (noun,
verbs, etc. ) are identified and lemmatized, i.e. reconducted to the canonical form.
 
Dictionary Lookup Annotator
: if present, this annotator search for occurrencies of particular
terms contained in a specific lexicon. This can be useful for technical or specific jargon where
words are not commonly found in vocaboulary.
 
Document processing pipeline / 2
 
Named Entity Recognition Annotator
: This block recognizes words line names of persons,
places or organizations, specific to the language
 
Pattern Matcher Annotator 
: This annotator recognizes specific expressions based on a literal
pattern like license plates, credit card numbers, social security numbers, etc.
 
Content Classification Annotator
: if present, this annotator is able to classify text content into
one or more given categories, according to the topics treated.
 
Document processing pipeline / 3
 
Machine-Learning Annotator
: If present, this block can be trained to recognize specific entities
and relations. Training is performed by providing some examples of such entities (more details
in next lesson)
 
Rule-Based Annotator 
: This annotator can combine annotations previously discovered in the
pipeline and build more complex annotations  using sort of «proximity rules» (for example a
date followed by a signature)
 
Custom Annotator
: if present, this block contains an annotator built using coding tools.
Software code may create a wide set of annotation of various types.
Example: Police Report
 
These are 
annotation types
 built
using annotator blocks. Different
types are sown with different
colors.
Annotation results are
shown here as 
text
highlights
. Using
annotations one can
discover relevant facts
into text.
 
Linguistic analysis
 
Linguistic analysis annotators build «grammar
analysis» of text and can identify POS (parts
of speech) and phrase constituents
This analysis can be very useful as the «first
inquiry» to discover facts and clues inside
body text
Looking especially to nouns and verbs it is
posible to answer the basic question: «
what
are they talking about here??
»
 
Linguistic analysis for Consumer Complaints: Nouns
 
Linguistic analysis for Consumer Complaints: Verbs
 
 
Hands On: Lab 4.2
 
Use Linguistic Analysis in Content Miner to analyze
and discover the type of activity for various companies
Identify at least 
five
 companies and try to understand
what kind of financial activity they usually are involved
in
Do not use Product facet
Report your findings in Exercise Form 4.2
 
 
Content Miner Link
http://172.31.1.2:8393/ui/analytics
No authentication required
 
 
Lab 4.2 Hints & Tips
 
Use Noun Facet in Facets View
Make a list of words that are significant for the banking
context (loan, mortgage, credit, etc.)
Try to identify some groups  of «activity types» by
grouping similar words
Note: activity type is an insight that 
you
 are
discovering!
Use Facet Pairs View (Company vs Nouns)
What nouns are more related to certain companies and
vice versa?
Using association with nouns, make associations with
activity type
Slide Note
Embed
Share

The course at University of Rome La Sapienza introduces students to IBM Watson Explorer for Business Intelligence. It covers practical use cases, advanced BI topics, and familiarization with real software used in organizations. The schedule includes sessions on Watson Explorer features, conducting BI use cases, and advanced content analytics. Reference materials from IBM are provided for further study. Today's topics include advanced content mining, using dashboards, and text analytics basics.

  • IBM Watson Explorer
  • Business Intelligence
  • University of Rome
  • La Sapienza
  • BI course

Uploaded on Apr 02, 2024 | 0 Views


Download Presentation

Please find below an Image/Link to download the presentation.

The content on the website is provided AS IS for your information and personal use only. It may not be sold, licensed, or shared on other websites without obtaining consent from the author. Download presentation by click this link. If you encounter any issues during the download, it is possible that the publisher has removed the file from their server.

E N D

Presentation Transcript


  1. University of Rome La Sapienza Course of Business Intelligence - 2017 Course Lab Introduction to IBM Watson Explorer Ing. Vittorio Carullo IBM Italia v.carullo@it.ibm.com

  2. Our Target Familiarize with a real software used in large organizations Accomplish small but significant use cases in BI arena Introduce advanced topics in BI like the use of non structured information

  3. Lab Schedule Lab sessions on Tuesday from October 17, 2017 , 4 - 6 pm Presentation of the Watson Explorer tool and its basic features (1 1.5 sessions) Use of the tool for conducting standard BI use cases (2 2.5 sessions) Use of the tool for Advanced Content Analytics (2 sessions)

  4. Reference Materials IBM Redbook on Watson Content Analytics http://www.redbooks.ibm.com/abstracts/sg247877.html?Open Suggested chapters: 1-6. Further chapters are more technical IBM Knowledge Center https://www.ibm.com/support/knowledgecenter/SS8NLW_11.0.2/co m.ibm.discovery.es.nav.doc/explorer_analytics.htm Use it just as a technical reference for product features

  5. Todays Contents: Advanced Content Mining 1. Using Dashboards 1. Lab 4.1 The Big Boss Request 2. Text Analytics basics 3. TBD Lab 3.2 TBDLAb 4. TBD 2

  6. 1. Using Dashboards

  7. The Dashboard View The Dashboard view shows various predefined charts and tables in a single view.

  8. Role of the Dashboard With this view, you can Visualize at one glance various aspects of the data to quickly interpret, analyze, and further investigate Define and show KPIs (Key Performance Indicators) for your sets that can directly give some evidence about your data Share the data with other people for collaboration purposes (authoring reports, etc.).

  9. Dashboard features Query driven results Unlike a static report, the results of the Dashboard charts are dynamically updated whenever you change a filter Interactive navigation In a dashboard, graphics are navigable and tied one with another. For instance, if an user clicks on a pie slice, the system automatically performs a query in real time to drill-down results, and all other graphs are updated in real time

  10. Dashboard components Dashboard View is composed by arranging a set of widgets in a predefined layout Each widget renders a representation of the analyzed data using a typical graphic metaphore Widget examples: Charts (bar/column/pie/bubble, .), Heat maps Word clouds ..

  11. Exploring the Dashboard View Action Selector Layout Selector Widgets

  12. Creating / Customizing Dashboard Dashboard View is composed by arranging a set of widgets in a predefined layout Each widget renders a representation of the analyzed data using a typical graphic metaphore Widget examples: Charts (bar/column/pie/bubble, .) Heat maps Word clouds ..

  13. Dashboard Actions Menu New: Create a new dashboard form scratch Save: Save modifications to current dashboard Save As: Save modifications to a new dashboard Customize: Open the Dashboard Editor to change current dashboard Delete: Delete current dashboard Refresh: Refresh the state of current dashboard

  14. Create a new Dashboard From Actions menu, select New. Type a name to identify the newly created dashboard

  15. Layouts Layouts provide two or more containers that define the graphic arrangement of the widgets. It is possible to choose an initial layout and then change it subsequently

  16. Dashboard Customizer Use the dashboard customizer to create widgets and arrange them into containers A container can contain one or more widgets Widgets have an external controller that handles graphical aspects and an internal controller that contain widgets configuration

  17. Container: external controller

  18. Container: internal controller

  19. Adding widgets to container WEX provides several types of widgets that can be added to dashboards Some widgets are the same present in other Content Miner views (Facets, Trends, etc) RAVE (Rapidly Adaptive Visualization Engine) charts are very handy for creating useful KPIs

  20. RAVE Charts: Important settings RAVE Template: the graphical form of the widget Facet to analyze: specify the facet that has to be visualized Sort by: Count first or Correlation first How many facet to display and to analyze for calculation

  21. Hands On: Lab 4.1 - The Big Boss Request - I want to know EVERYTHING about consumer complaints: What are most frequent issues In what US state What are most problematic companies and how they behaved during the year If they respond on time ALL with drill-down ALL in one screen

  22. Lab 4.1 Hints & Tips Create a dashboard to satisfy the Big Boss Request Use a two columns layout Put two widgets in each container Use Correlation Rave Chart for count Issues, States and Timely Responses Pie charts are OK when there are few values For more values use column/bar chart For a lot of values use word cloud Use Trends Rave Chart for Company performances during time (set time scale to Month) Content Miner Link http://172.31.1.2:8393/ui/analytics

  23. 2. Text Analytics

  24. Remember the body? During last lesson, we focused on the need to define as text body the most significant part of our data in descriptive terms Examples Tweets -> body = message Customer complaints -> body = problem description Quality assurance -> body = product feedback . Body is automatically set as Analyzable index field during Watson Explorer collection configuration

  25. What happens to Analyzable text? Index fields set as analyzable are processed in a special way during indexing phase by a WEX component called Document Processor In order to provide text analysis, Document Processor uses a pipeline of processing blocks called annotators, each one specialized for a particular analytical task. The final purpose is to enrich text documents with the most complete possible set of analytical facets, i.e. metadata related to syntax, semantics and meaning of the text Analytical facets may be used, together with other structured facets, to search/filter/examine content and discover insights in an even more powerful fashion

  26. Document processing pipeline

  27. Document processing pipeline / 1 Language Identification Annotator: understands and takes note of language used. A wide variety of languages are supported. Linguistic Analysis Annotator : Based on the identified language, linguistic structures (noun, verbs, etc. ) are identified and lemmatized, i.e. reconducted to the canonical form. Dictionary Lookup Annotator: if present, this annotator search for occurrencies of particular terms contained in a specific lexicon. This can be useful for technical or specific jargon where words are not commonly found in vocaboulary.

  28. Document processing pipeline / 2 Named Entity Recognition Annotator: This block recognizes words line names of persons, places or organizations, specific to the language Pattern Matcher Annotator : This annotator recognizes specific expressions based on a literal pattern like license plates, credit card numbers, social security numbers, etc. Content Classification Annotator: if present, this annotator is able to classify text content into one or more given categories, according to the topics treated.

  29. Document processing pipeline / 3 Machine-Learning Annotator: If present, this block can be trained to recognize specific entities and relations. Training is performed by providing some examples of such entities (more details in next lesson) Rule-Based Annotator : This annotator can combine annotations previously discovered in the pipeline and build more complex annotations using sort of proximity rules (for example a date followed by a signature) Custom Annotator: if present, this block contains an annotator built using coding tools. Software code may create a wide set of annotation of various types.

  30. Linguistic analysis Linguistic analysis annotators build grammar analysis of text and can identify POS (parts of speech) and phrase constituents This analysis can be very useful as the first inquiry to discover facts and clues inside body text Looking especially to nouns and verbs it is posible to answer the basic question: what are they talking about here??

  31. Linguistic analysis for Consumer Complaints: Nouns

  32. Linguistic analysis for Consumer Complaints: Verbs

  33. Hands On: Lab 4.2 Use Linguistic Analysis in Content Miner to analyze and discover the type of activity for various companies Identify at least five companies and try to understand what kind of financial activity they usually are involved in Do not use Product facet Report your findings in Exercise Form 4.2 Content Miner Link http://172.31.1.2:8393/ui/analytics No authentication required

  34. Lab 4.2 Hints & Tips Use Noun Facet in Facets View Make a list of words that are significant for the banking context (loan, mortgage, credit, etc.) Try to identify some groups of activity types by grouping similar words Note: activity type is an insight that you are discovering! Use Facet Pairs View (Company vs Nouns) What nouns are more related to certain companies and vice versa? Using association with nouns, make associations with activity type

Related


More Related Content

giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#giItT1WQy@!-/#