SlideShare a Scribd company logo
Big Data Insight




Blueprint for Integrating Big Data Analytics and BI
Abe Taha, VP Engineering
abetaha@karmasphere.com




www.karmasphere.com
Big Data Insight


>  Agenda



ĂŒïƒŒâ€Ż Where does Big Data Analytics fit in the BI ecosystem
ĂŒïƒŒâ€Ż How does Big Data Analytics complement the type of analysis we do today using BI
ĂŒïƒŒâ€Ż What are clients doing with Big Data Analytics that they couldn’t do with BI
ĂŒïƒŒâ€Ż What do we need to think about to make Hadoop deployments successful




2                                                Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
Big Data Insight


>  Hadoop not standing alone
Big Data Insight


>  Parallel and Complementary Stacks
Big Data Insight


    >  The Best of Both Worlds = Big Data Analytics + Traditional BI


                           Traditional BI                     Big Data Analytics
       Purpose             Reporting on business              Optimizing the business
       Paradigm            Ask a specific question            Ask any question
       Format              Look at structured data            Look at all data
       Setup               Pre-engineered                     On-the-fly
       Data locations       Siloed                            One place
       Agility              Weeks to months                   Almost Immediate




5                                                    Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
Big Data Insight


    >  Big Data Analytics on Hadoop Use Cases



        Product
      Optimization      ‱  Insight to usage patterns, bug paths, quality outages
                        ‱  Outline new features, improve product roadmap and process
                        ‱  Enhance customer service, quality and product “stickiness”


     Unified Customer
           View         ‱  Insight to correlations across product lines and interaction channels
                        ‱  Personalize oïŹ€ers, services and customer experience
                        ‱  Reduce churn and increase customer satisfaction


       Marketing
      Performance       ‱  Insight to market program attribution and ROI
                        ‱  Increase customer targeting through micro-segmentation
                        ‱  Optimize online ads and cross channel programs




6                                                                                   © Karmasphere 2012
Big Data Insight


    >  What Hadoop Adopters Are Saying



      “The kind of new stuïŹ€
         we want to do
       can’t get done with
                BI“
           Large Hi Tech Chip Manufacturer

7                                        Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
Big Data Insight


>  How to make Hadoop successful with BI



1.  Employ All Data
2.  Use All Analytic Assets
3.  Provide Self-Service Access for All Users
4.  Build a Collaborative Environment
5.  Be Open and Extensible
6.  Populate Best-of-Breed Reporting Tools
Big Data Insight


>  Cornerstone 1: Employ All Data



ĂŒïƒŒâ€Ż Leave No Data Behind
  ‱    Raw unstructured – Web logs, machine /
       sensor data, mobile social, video, etc.

  ‱    Structured data – traditional RDMBS, EDW’s

  ‱    Streaming vs. batch oriented

  ‱    Data governance and quality
Big Data Insight


>  Cornerstone 2: Use All Analytic Assets



ĂŒïƒŒâ€Ż Employ All Analytic Assets
   ‱    Traditional models and assets

   ‱    Standard Hadoop components including
        UDFs and SerDes

   ‱    Custom algorithms

   ‱    Models created in other systems such as
        SAS/R
Big Data Insight


>  Cornerstone 3: Provide Self-Service Access for All Users



ĂŒïƒŒâ€Ż Self-Service
‱    BYOD: Bring Your Own Data
‱    Ingest custom functions and algorithms
‱    Intuitive, no special skill sets required

ĂŒïƒŒâ€Ż Empower All Users and Skill Sets
‱    Business User
     ‱    Easy-to-use ad-hoc analysis, web-based forms
     ‱    Drag and drop

‱    Data Analysts
     ‱    Common skills: SQL
     ‱    Powerful iterative analysis
     ‱    Analytical models and algorithms

‱    Customers and Partners for ecosystem
Big Data Insight


>  Cornerstone 4: Build a Collaborative Environment



ĂŒïƒŒâ€Ż Collaborative
‱  Project-based environment

‱  Leverage cross-functional skills

‱  Security and isolation

ĂŒïƒŒâ€Ż Social
‱  Share data and insights across teams
   ‱    Metadata, Queries, Results and Visualizations

‱  View colleague’s activities

‱  Usage feedback and metrics
Big Data Insight


>  Cornerstone 5: Be Open and Extensible



ĂŒïƒŒâ€Ż Open
‱  Active community, rapid innovation

‱  Vendor commitment

‱  Standards based
‱  Portable - No vendor lock-in

‱  Expose standard API’s and interfaces


ĂŒïƒŒâ€Ż Extensible
‱  Add custom functions

‱  Reuse existing analytic models
‱  Add additional data sources by defining custom parsers
Big Data Insight


>      Cornerstone 6: Populate Best-of-Breed Reporting Tools



ĂŒïƒŒâ€Ż Best-Of-Breed Reporting tools
‱  Ingest data from existing BI systems and ad hoc data including
     Spreadsheet data

‱  Automate delivery of insights

‱  Push insights to RDBMS, EDW’s and MPP

‱  Expose standards APIs for programmability
Big Data Insight


     >  How would an architecture look




15                                       Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
Big Data Insight


      >  Summary


1.  Implement Big Data Analytics and BI co-existence   Hadoop at your fingertips
2.  Leverage all your assets
3.  Use and build on open and extensible solutions     across your company

4.  Build social and collaborative in early            	
  




                                                                            Private and Confidential
Big Data Insight


>  Summary Get the Best of Both Worlds – Build a Bridge Inside Your Company


                                           Big Data Analytics on Hadoop
                                           Future, see intent
                                           Drives Optimization
   BI                                      Just getting started
   Historical
   Drives reporting
   Entrenched
   Be around for a long time
Questions?
abetaha@karmasphere.com	
  
www.karmasphere.com	
  
	
  

More Related Content

PDF
Next generation big data bi
Stanley Wang
 
PDF
Integrating BI - Data Warehouse and Big Data
Accenture Analytics
 
PPTX
Business intelligence
SIBICHAKKARAVARTHYCM
 
PPTX
From Business Intelligence to Big Data - hack/reduce Dec 2014
Adam Ferrari
 
PPT
Oracle business intelligence 11g overview by aorta
Aorta business intelligence
 
PPTX
Birst for SAP HANA
Birst
 
PPTX
Big Data, Business Intelligence and Data Analytics
Systems Limited
 
PPTX
AI and Marketing: Robot-proofing Your Job
Call Sumo
 
Next generation big data bi
Stanley Wang
 
Integrating BI - Data Warehouse and Big Data
Accenture Analytics
 
Business intelligence
SIBICHAKKARAVARTHYCM
 
From Business Intelligence to Big Data - hack/reduce Dec 2014
Adam Ferrari
 
Oracle business intelligence 11g overview by aorta
Aorta business intelligence
 
Birst for SAP HANA
Birst
 
Big Data, Business Intelligence and Data Analytics
Systems Limited
 
AI and Marketing: Robot-proofing Your Job
Call Sumo
 

What's hot (20)

PPTX
Datamensional Business Intelligence and Data Services
Datamensional
 
PPTX
Need of business intelligence
Vivek Mohan
 
PPTX
Location Intelligence - the Next Evolution of Business Applications
MISNet - Integeo SE Asia
 
PPT
The evolution of Business Intelligence
Richard Claassens CIPPE
 
PPTX
How different between Big Data, Business Intelligence and Analytics ?
Thanakrit Lersmethasakul
 
DOCX
Business Intelligence
Sukirti Garg
 
PPTX
Overview of Business Intelligence
Parthiv Dixit
 
PDF
Self-Service BI Trends
Netwoven Inc.
 
PPTX
Big Data Case study - caixa bank
Chungsik Yun
 
PPTX
Big Data and Semantic Web in Manufacturing
Nitesh Khilwani
 
PPTX
Instant Analytics with Birst and SAP HANA Cloud Platform for #sitNL
Richard Neale
 
PDF
Big Data Analytic with Hadoop: Customer Stories
Yellowfin
 
PDF
Spring 2017 Sage 300 (Accpac) Users Group
Gross, Mendelsohn & Associates
 
PPTX
New Approach to Supply Chain Analytics
demando
 
PDF
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Matt Stubbs
 
PPT
The Evolution of Business Intelligence
Call Sumo
 
PPTX
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
MITX
 
PPTX
Tools and techniques for predictive analytics
RohanKumarJumnani
 
PDF
Data analytics as a service
Stanley Wang
 
PDF
The Present - the History of Business Intelligence
Phocas Software
 
Datamensional Business Intelligence and Data Services
Datamensional
 
Need of business intelligence
Vivek Mohan
 
Location Intelligence - the Next Evolution of Business Applications
MISNet - Integeo SE Asia
 
The evolution of Business Intelligence
Richard Claassens CIPPE
 
How different between Big Data, Business Intelligence and Analytics ?
Thanakrit Lersmethasakul
 
Business Intelligence
Sukirti Garg
 
Overview of Business Intelligence
Parthiv Dixit
 
Self-Service BI Trends
Netwoven Inc.
 
Big Data Case study - caixa bank
Chungsik Yun
 
Big Data and Semantic Web in Manufacturing
Nitesh Khilwani
 
Instant Analytics with Birst and SAP HANA Cloud Platform for #sitNL
Richard Neale
 
Big Data Analytic with Hadoop: Customer Stories
Yellowfin
 
Spring 2017 Sage 300 (Accpac) Users Group
Gross, Mendelsohn & Associates
 
New Approach to Supply Chain Analytics
demando
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Matt Stubbs
 
The Evolution of Business Intelligence
Call Sumo
 
#MITXData 2014 - Leveraging Self-Service Business Intelligence to Drive Marke...
MITX
 
Tools and techniques for predictive analytics
RohanKumarJumnani
 
Data analytics as a service
Stanley Wang
 
The Present - the History of Business Intelligence
Phocas Software
 
Ad

Viewers also liked (10)

PDF
Malaysia Big Data Analytics Initiative: 2015 Imperatives
Peter Kua
 
PDF
Text visualization - by Jeff Clark
Cindy Xiao
 
PDF
Bi on Big Data - Strata 2016 in London
Dremio Corporation
 
PDF
Bi isn't big data and big data isn't BI (updated)
mark madsen
 
PDF
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BICC Thomas More
 
PDF
How big data is transforming BI
DeZyre
 
PDF
What is bi analytics and big data
galiasisense
 
PPTX
Big Data and BI Best Practices
Yellowfin
 
PDF
Analytics Trends 2016: The next evolution
Deloitte United States
 
PDF
Big Data visualization with Apache Spark and Zeppelin
prajods
 
Malaysia Big Data Analytics Initiative: 2015 Imperatives
Peter Kua
 
Text visualization - by Jeff Clark
Cindy Xiao
 
Bi on Big Data - Strata 2016 in London
Dremio Corporation
 
Bi isn't big data and big data isn't BI (updated)
mark madsen
 
BI congres 2014-5: from BI to big data - Jan Aertsen - Pentaho
BICC Thomas More
 
How big data is transforming BI
DeZyre
 
What is bi analytics and big data
galiasisense
 
Big Data and BI Best Practices
Yellowfin
 
Analytics Trends 2016: The next evolution
Deloitte United States
 
Big Data visualization with Apache Spark and Zeppelin
prajods
 
Ad

Similar to Blueprint for integrating big data analytics and bi (20)

PPTX
Karmasphere bdabi blueprint- final
Abe Taha
 
PPTX
Big data and bi best practices slidedeck
Actian Corporation
 
PDF
Time to Fly - Why Predictive Analytics is Going Mainstream
Inside Analysis
 
PDF
Hadoop and SQL: Delivery Analytics Across the Organization
Seeling Cheung
 
PDF
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
BigMine
 
PPTX
Big data analytics - hadoop
Vishwajeet Jadeja
 
PDF
Hot Technologies of 2013: Hadoop 2.0
Inside Analysis
 
PPTX
Big Data in Azure
DataWorks Summit/Hadoop Summit
 
PPTX
New Innovations in Information Management for Big Data - Smarter Business 2013
IBM Sverige
 
PDF
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Cynthia Saracco
 
PPTX
Bi 4.0 Migration Strategy and Best Practices
Eric Molner
 
PPTX
Anexinet Big Data Solutions
Mark Kromer
 
PPTX
Building a Modern Analytic Database with Cloudera 5.8
Cloudera, Inc.
 
PDF
Create your Big Data vision and Hadoop-ify your data warehouse
Jeff Kelly
 
PPTX
Big data and hadoop
Sri Kanth
 
PPTX
Big data by Mithlesh sadh
Mithlesh Sadh
 
PDF
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
PPT
Scaling Data overview
Wade Malone
 
PDF
How to implement Hadoop successfully
Adir Sharabi
 
PPT
Oh! Session on Introduction to BIG Data
Prakalp Agarwal
 
Karmasphere bdabi blueprint- final
Abe Taha
 
Big data and bi best practices slidedeck
Actian Corporation
 
Time to Fly - Why Predictive Analytics is Going Mainstream
Inside Analysis
 
Hadoop and SQL: Delivery Analytics Across the Organization
Seeling Cheung
 
Big Data Analytics: Applications and Opportunities in On-line Predictive Mode...
BigMine
 
Big data analytics - hadoop
Vishwajeet Jadeja
 
Hot Technologies of 2013: Hadoop 2.0
Inside Analysis
 
Big Data in Azure
DataWorks Summit/Hadoop Summit
 
New Innovations in Information Management for Big Data - Smarter Business 2013
IBM Sverige
 
Big Data: InterConnect 2016 Session on Getting Started with Big Data Analytics
Cynthia Saracco
 
Bi 4.0 Migration Strategy and Best Practices
Eric Molner
 
Anexinet Big Data Solutions
Mark Kromer
 
Building a Modern Analytic Database with Cloudera 5.8
Cloudera, Inc.
 
Create your Big Data vision and Hadoop-ify your data warehouse
Jeff Kelly
 
Big data and hadoop
Sri Kanth
 
Big data by Mithlesh sadh
Mithlesh Sadh
 
BAR360 open data platform presentation at DAMA, Sydney
Sai Paravastu
 
Scaling Data overview
Wade Malone
 
How to implement Hadoop successfully
Adir Sharabi
 
Oh! Session on Introduction to BIG Data
Prakalp Agarwal
 

More from DataWorks Summit (20)

PPTX
Data Science Crash Course
DataWorks Summit
 
PPTX
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
 
PPTX
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
 
PDF
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
 
PPTX
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
 
PPTX
Managing the Dewey Decimal System
DataWorks Summit
 
PPTX
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
 
PPTX
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
PPTX
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
 
PPTX
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
 
PPTX
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
 
PPTX
Security Framework for Multitenant Architecture
DataWorks Summit
 
PDF
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
 
PPTX
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
PPTX
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
 
PPTX
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
 
PPTX
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
 
PPTX
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
 
PDF
Computer Vision: Coming to a Store Near You
DataWorks Summit
 
PPTX
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
 
Data Science Crash Course
DataWorks Summit
 
Floating on a RAFT: HBase Durability with Apache Ratis
DataWorks Summit
 
Tracking Crime as It Occurs with Apache Phoenix, Apache HBase and Apache NiFi
DataWorks Summit
 
HBase Tales From the Trenches - Short stories about most common HBase operati...
DataWorks Summit
 
Optimizing Geospatial Operations with Server-side Programming in HBase and Ac...
DataWorks Summit
 
Managing the Dewey Decimal System
DataWorks Summit
 
Practical NoSQL: Accumulo's dirlist Example
DataWorks Summit
 
HBase Global Indexing to support large-scale data ingestion at Uber
DataWorks Summit
 
Scaling Cloud-Scale Translytics Workloads with Omid and Phoenix
DataWorks Summit
 
Building the High Speed Cybersecurity Data Pipeline Using Apache NiFi
DataWorks Summit
 
Supporting Apache HBase : Troubleshooting and Supportability Improvements
DataWorks Summit
 
Security Framework for Multitenant Architecture
DataWorks Summit
 
Presto: Optimizing Performance of SQL-on-Anything Engine
DataWorks Summit
 
Introducing MlFlow: An Open Source Platform for the Machine Learning Lifecycl...
DataWorks Summit
 
Extending Twitter's Data Platform to Google Cloud
DataWorks Summit
 
Event-Driven Messaging and Actions using Apache Flink and Apache NiFi
DataWorks Summit
 
Securing Data in Hybrid on-premise and Cloud Environments using Apache Ranger
DataWorks Summit
 
Big Data Meets NVM: Accelerating Big Data Processing with Non-Volatile Memory...
DataWorks Summit
 
Computer Vision: Coming to a Store Near You
DataWorks Summit
 
Big Data Genomics: Clustering Billions of DNA Sequences with Apache Spark
DataWorks Summit
 

Recently uploaded (20)

PDF
Revolutionize Operations with Intelligent IoT Monitoring and Control
Rejig Digital
 
PDF
Doc9.....................................
SofiaCollazos
 
PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
PPT
L2 Rules of Netiquette in Empowerment technology
Archibal2
 
PDF
Why Your AI & Cybersecurity Hiring Still Misses the Mark in 2025
Virtual Employee Pvt. Ltd.
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
Orbitly Pitch DeckA Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
PPTX
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
PDF
Best ERP System for Manufacturing in India | Elite Mindz
Elite Mindz
 
PDF
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PPTX
Comunidade Salesforce SĂŁo Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira JĂșnior
 
PPTX
Coupa-Overview _Assumptions presentation
annapureddyn
 
PDF
Software Development Methodologies in 2025
KodekX
 
PPT
Coupa-Kickoff-Meeting-Template presentai
annapureddyn
 
PPTX
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
PDF
Chapter 2 Digital Image Fundamentals.pdf
Getnet Tigabie Askale -(GM)
 
PPTX
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
PDF
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 
Revolutionize Operations with Intelligent IoT Monitoring and Control
Rejig Digital
 
Doc9.....................................
SofiaCollazos
 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
L2 Rules of Netiquette in Empowerment technology
Archibal2
 
Why Your AI & Cybersecurity Hiring Still Misses the Mark in 2025
Virtual Employee Pvt. Ltd.
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Orbitly Pitch DeckA Mission-Driven Platform for Side Project Collaboration (...
zz41354899
 
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
Best ERP System for Manufacturing in India | Elite Mindz
Elite Mindz
 
Structs to JSON: How Go Powers REST APIs
Emily Achieng
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
Comunidade Salesforce SĂŁo Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira JĂșnior
 
Coupa-Overview _Assumptions presentation
annapureddyn
 
Software Development Methodologies in 2025
KodekX
 
Coupa-Kickoff-Meeting-Template presentai
annapureddyn
 
cloud computing vai.pptx for the project
vaibhavdobariyal79
 
Chapter 2 Digital Image Fundamentals.pdf
Getnet Tigabie Askale -(GM)
 
OA presentation.pptx OA presentation.pptx
pateldhruv002338
 
Accelerating Oracle Database 23ai Troubleshooting with Oracle AHF Fleet Insig...
Sandesh Rao
 

Blueprint for integrating big data analytics and bi

  • 1. Big Data Insight Blueprint for Integrating Big Data Analytics and BI Abe Taha, VP Engineering abetaha@karmasphere.com www.karmasphere.com
  • 2. Big Data Insight >  Agenda ĂŒïƒŒâ€Ż Where does Big Data Analytics fit in the BI ecosystem ĂŒïƒŒâ€Ż How does Big Data Analytics complement the type of analysis we do today using BI ĂŒïƒŒâ€Ż What are clients doing with Big Data Analytics that they couldn’t do with BI ĂŒïƒŒâ€Ż What do we need to think about to make Hadoop deployments successful 2 Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
  • 3. Big Data Insight >  Hadoop not standing alone
  • 4. Big Data Insight >  Parallel and Complementary Stacks
  • 5. Big Data Insight >  The Best of Both Worlds = Big Data Analytics + Traditional BI Traditional BI Big Data Analytics Purpose Reporting on business Optimizing the business Paradigm Ask a specific question Ask any question Format Look at structured data Look at all data Setup Pre-engineered On-the-fly Data locations Siloed One place Agility Weeks to months Almost Immediate 5 Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
  • 6. Big Data Insight >  Big Data Analytics on Hadoop Use Cases Product Optimization ‱  Insight to usage patterns, bug paths, quality outages ‱  Outline new features, improve product roadmap and process ‱  Enhance customer service, quality and product “stickiness” Unified Customer View ‱  Insight to correlations across product lines and interaction channels ‱  Personalize oïŹ€ers, services and customer experience ‱  Reduce churn and increase customer satisfaction Marketing Performance ‱  Insight to market program attribution and ROI ‱  Increase customer targeting through micro-segmentation ‱  Optimize online ads and cross channel programs 6 © Karmasphere 2012
  • 7. Big Data Insight >  What Hadoop Adopters Are Saying “The kind of new stuïŹ€ we want to do can’t get done with BI“ Large Hi Tech Chip Manufacturer 7 Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
  • 8. Big Data Insight >  How to make Hadoop successful with BI 1.  Employ All Data 2.  Use All Analytic Assets 3.  Provide Self-Service Access for All Users 4.  Build a Collaborative Environment 5.  Be Open and Extensible 6.  Populate Best-of-Breed Reporting Tools
  • 9. Big Data Insight >  Cornerstone 1: Employ All Data ĂŒïƒŒâ€Ż Leave No Data Behind ‱  Raw unstructured – Web logs, machine / sensor data, mobile social, video, etc. ‱  Structured data – traditional RDMBS, EDW’s ‱  Streaming vs. batch oriented ‱  Data governance and quality
  • 10. Big Data Insight >  Cornerstone 2: Use All Analytic Assets ĂŒïƒŒâ€Ż Employ All Analytic Assets ‱  Traditional models and assets ‱  Standard Hadoop components including UDFs and SerDes ‱  Custom algorithms ‱  Models created in other systems such as SAS/R
  • 11. Big Data Insight >  Cornerstone 3: Provide Self-Service Access for All Users ĂŒïƒŒâ€Ż Self-Service ‱  BYOD: Bring Your Own Data ‱  Ingest custom functions and algorithms ‱  Intuitive, no special skill sets required ĂŒïƒŒâ€Ż Empower All Users and Skill Sets ‱  Business User ‱  Easy-to-use ad-hoc analysis, web-based forms ‱  Drag and drop ‱  Data Analysts ‱  Common skills: SQL ‱  Powerful iterative analysis ‱  Analytical models and algorithms ‱  Customers and Partners for ecosystem
  • 12. Big Data Insight >  Cornerstone 4: Build a Collaborative Environment ĂŒïƒŒâ€Ż Collaborative ‱  Project-based environment ‱  Leverage cross-functional skills ‱  Security and isolation ĂŒïƒŒâ€Ż Social ‱  Share data and insights across teams ‱  Metadata, Queries, Results and Visualizations ‱  View colleague’s activities ‱  Usage feedback and metrics
  • 13. Big Data Insight >  Cornerstone 5: Be Open and Extensible ĂŒïƒŒâ€Ż Open ‱  Active community, rapid innovation ‱  Vendor commitment ‱  Standards based ‱  Portable - No vendor lock-in ‱  Expose standard API’s and interfaces ĂŒïƒŒâ€Ż Extensible ‱  Add custom functions ‱  Reuse existing analytic models ‱  Add additional data sources by defining custom parsers
  • 14. Big Data Insight >  Cornerstone 6: Populate Best-of-Breed Reporting Tools ĂŒïƒŒâ€Ż Best-Of-Breed Reporting tools ‱  Ingest data from existing BI systems and ad hoc data including Spreadsheet data ‱  Automate delivery of insights ‱  Push insights to RDBMS, EDW’s and MPP ‱  Expose standards APIs for programmability
  • 15. Big Data Insight >  How would an architecture look 15 Karmasphere Proprietary and Confidential. Do Not Copy. Do Not Distribute
  • 16. Big Data Insight >  Summary 1.  Implement Big Data Analytics and BI co-existence Hadoop at your fingertips 2.  Leverage all your assets 3.  Use and build on open and extensible solutions across your company
 4.  Build social and collaborative in early   Private and Confidential
  • 17. Big Data Insight >  Summary Get the Best of Both Worlds – Build a Bridge Inside Your Company Big Data Analytics on Hadoop Future, see intent Drives Optimization BI Just getting started Historical Drives reporting Entrenched Be around for a long time