Get Started. It's Free
or sign up with your email address
BAA by Mind Map: BAA

1. Starting Points

1.1. How do we define "big data"?

1.1.1. By the nature of the data... Classic IBM Definition Other Perspectives Bandwidth of the Senses Singularity Theory

1.1.2. By the nature of the solutions... "Machine Learning" Supervised Learning for Prediction Supervised Learning for Classification Unsupervised Learning for Clustering "Autonomous Systems" Autonomous Systems = Sensing + Learning + Acting

1.1.3. By the nature of the technology... Big Data Technology Landscape Distinctive Elements "Wrangling" - the way data is moved and shaped "Platforms" - the way data is stored, organised, managed, and accessed "Data Science" - the way data is used to generate insights and signals

1.1.4. The reality (an opinion)... The definition of "Big Data" is dependent on context

1.2. What are the objectives of this course?

1.2.1. "...Gain an overview of business applications of big data and analytics techniques..." "Use Cases" "Techniques"

1.2.2. "...Gain real-world insights into various applications of big data analytics and how it can be used to fuel better decision-making within an organisation/business..." "Catalogue" "Challenges"

1.2.3. "...Learn how big data and analytics techniques can create business value..." "Business Case"

1.2.4. "...Understand how to manage big data and analytics projects and teams..." "Technology" "People" "Process"

1.3. How are we going to approach the material?

1.3.1. A catalogue view of "Use Cases"

1.3.2. A thematic dissection of "Use Cases"

2. D. Techniques

2.1. Understanding what analytics can do...

2.1.1. The Analytics Curve Perspective: Describe, Diagnose, Predict, Prescribe

2.1.2. Machine Learning Methods Illustration: Machine Learning Family Tree Principal Types Supervised Learning Unsupervised Learning Reinforcement Learning

2.2. Tasks

2.2.1. Review the United Airlines case, what machine learning technique were they likely to be using?

2.2.2. Team Boards Team #1 Team #2 Team #3

3. E. Technology

3.1. Technology is a critical enabler of big data solutions...

3.1.1. Big Data Technology Landscape

3.1.2. Principal Technology Challenges ...moving massive volumes of high speed, high variety data from source to the "right place, at the right time, in the right shape"... "Wrangling" ...storing, architecting, and managing massive volumes of high variety data, whilst ensuring it is ready to use... "Platforms" ...driving insights from data assets in a form and on a schedule that is useful... "Data Science"

3.1.3. Pipeline Architecture Big data solutions typically take the form of a pipeline of common components... Illustration: Iguazio Perspective: Principal Pipeline Components Cloud Implementations

3.1.4. The Apache Stack Tools from the Apache Foundation are dominant in the big data world... Illustration: Apache Pipeline "Killer Apps" of the Apache Stack Illustration: Wider Apache Big Data Projects

3.1.5. An Emerging Class ...technologies responding to the choking of data availability resulting from privacy and data monetisation trends... Context Perspectives: Secure Sharing Platforms, Safe Spaces, and Sandboxes Example: Google Ads Data Hub

3.1.6. Tasks Validate Stacks 1. Go to 2. Browse Stacks 3. Select a Big Data Candidate 4. Consider Their Configuration Build a stack in Target Stack 1. Create a Free Account in 2. Create New Blueprint 3. Compile Components 4. Join Arrows from User to Source Team Boards Team #1 Team #2 Team #3

4. F. Process

4.1. What the human brings to the picture...

4.1.1. Perspective: Translation

4.1.2. Component Processes and Patterns Aligning the technical solution to business objectives... Illustration: Tibco CRISP-DM Selecting the right data science technique... Illustration: Machine Learning Model Selection Managing an exploratory approach... Illustration: OODA Training, testing, and iterating machine learning models... Illustration: Towards Data Science Illustration: Automated Machine Learning ...and adopting the learning pattern... Positioning the learning process... the model going to learn and act "online", or will be trained in a sandbox and "batch" insights released? Maintaining model quality... Illustration: Overfitting Avoidance Illustration: Balancing Bias

4.1.3. Hybrid Models Illustration: Uber Michaelangelo Illustration: Particle Data

4.1.4. Tasks Review and discuss the processes implicit in the Grab Catwalk case... Grab Catwalk Team Boards Team #1 Team #2 Team #3

5. G. People

5.1. Despite all the "tech", success and failure in big data still hinges on people...

5.1.1. Training Big Data Talent ...the challenge of maintaining skills relevance... Illustration: Stack "Fingerprints" Illustration: Data Science Unicorns Skills Fingerprints

5.1.2. Organising Big Data Talent Blending Hubs and Markets Perspective: Internal Markets Perspective: CoE Capabilities Illustration: CoE Positioning Perspective: Specifying a CoE Illustration: Uber Michaelangelo "...Successfully scaling ML at a company like Uber requires getting much more than just the technology right—there are important considerations for organization and process design as well. In this section, we look at critical success factors across three pillars: organization, process, as well as technology..."

5.1.3. Tasks Brainstorm the principal "Purple Challenges" an organisation aspiring to big data leadership might face... Team Boards Team #1 Team #2 Team #3

6. User Notes

6.1. Mindmeister

6.1.1. Single source for materials

6.1.2. Public link stays public

6.1.3. Take a personal copy if you want to keep!

6.2. Google Jamboards

6.2.1. For collaboration

6.2.2. Take a personal copy if you want to keep!

7. H. Challenges

7.1. Big data analytics initiatives routinely fail, why?...

7.1.1. Framing: McKinsey's 10 Flags 1. The executive team doesn’t have a clear vision for its advanced-analytics programs Illustration: Gartner Hype Cycle Illustration: Growth of the Dataverse 2. No one has determined the value that the initial use cases can deliver in the first year Illustration: McKinsey Prioritisation 3. There’s no analytics strategy beyond a few use cases Illustration: Play to Win Choice Cascade 4. Analytics roles - present and future - are poorly defined Illustration: Data Science Unicorns Illustration: Uber Michaelangelo 5. The organization lacks analytics translators Illustration: Purple People 6. Analytics capabilities are isolated from the business, resulting in an ineffective analytics organization structure Illustration: Internal Markets Illustration: CoE Positioning 7. Costly data-cleansing efforts are started en masse Illustration: Tibco CRISP-DM 8. Analytics platforms aren’t built to purpose Illustration: Apache Pipeline 9. Nobody knows the quantitative impact that analytics is providing Recommended: Case Making Map 10. No one is hyperfocused on identifying potential ethical, social, and regulatory implications of analytics initiatives Illustration: Singapore MAS "FEAT" Illustration: United Airlines

7.1.2. Tasks Discuss the FEAT potential exposure of Apple in the credit card case... Exhibit: Apple Credit Card Case Team Boards Team #1 Team #2 Team #3

8. C. Business Case

8.1. Forming a language around the benefits of Big Data...

8.1.1. Impact Assessment and Communication Tools Illustration: Play to Win Choice Cascade Illustration: Dataiku Classification of Returns Illustration: McKinsey Top Bottom Line Benefits Extension: McKinsey Prioritisation Illustration: Decomposition with Impact Maps

8.1.2. Recommended: Case Making Map

8.1.3. Tasks Use the case-making map to frame a business case underpinning a "Business Intelligence" (dashboard) investment... Team Boards Team #1 Team #2 Team #3

9. B. Catalogue

9.1. Optimise Action in a System

9.1.1. "Optimise" "Pattern, Prediction, Prescription" Illustration: The Analytics Curve

9.1.2. "System" Human Personalisation Machine Operational Optimisation Mixed Economics and Finance Sports Commercial Environmental

9.1.3. "Action" Decision Support Example: Dashboards System Configurations Example: Uber Example: IFTTT Autonomous Systems Example: Level 5 Autonomous Vehicles Example: Autonomous Service Bots Example: Intelligence Process Automation

10. A. Use Cases

10.1. A canvas technique for design and communication...

10.1.1. Nature(s) Type 1. A strategic narrative... Play to Win Type 2. A process architecture... Uber Use Case Diagram Type 3. A domains checklist... Big Data Management Canvas

10.1.2. Recommended: Use Case Template

10.1.3. Tasks Translate the German Rail example onto a "Use Case" template... Team Boards Team #1 Team #2 Team #3