Published: April 26, 2021

Tech Talk: Growing Role of Big Data

Harvesting business intelligence from data is nothing new. What makes Big Data different are the three Vs: volume, velocity, and variety.

Mike Robey, MS, AAO-HNS/F Senior Director, Information Technology

Harvesting business intelligence from data is nothing new. Decision support systems and executive information systems have been around since the 1980s. What makes Big Data different are the three Vs: volume, velocity, and variety. Today’s datasets are huge. Volumes in the terabyte (2 to the 40th power. 2⁴⁰) and petabyte (2 to the power of 50. 2⁵⁰) range are common. Multiple datasets, from internal and external sources, are needed for complex analysis. Combined with the velocity of which new data is getting created, a traditional organization-specific computing environment cannot keep up with demand. To further complicate things, today’s data is not just transactional. Meta data, monitoring systems, documents, online postings, and other unstructured data add to the variety of data produced today. Big Data is the field that includes the repositories to support the analysis from these huge disparate sources. This article introduces the major components to give you a better understanding.

Where Is Big Data Stored?

Two terms you may have heard are data warehouses and data lakes (see sidebar for definitions). Both have one thing in common: They are separate repositories than the source systems. Having separate databases for business intelligence and reporting is not new. The rationale is to not hinder the ability of the source system to record transactions. Relational databases, consisting of rows and columns, are excellent structures to support transactions. They are not the most efficient for multidimensional reporting, such as time series. Multidimensional analytical data structures were developed to support complex business intelligence reporting. These were fed from disparate source systems so as not to hinder the integrity nor ability of the source systems to do their intended job.

As the figure illustrates, loading these central repositories is a three-step process: Extract, Transform, and Load (ETL). The extract phase identifies the source systems where data will be pulled. The source systems may be internal, such as accounting, human resources, and customer relations. External systems, such as census, other government data, or resource data, may also be included. Pertinent data is then harvested from the source systems and transformed in the second phase. Once transformed, data is then loaded into the repository.

ETL is managed by an overall data governance policy, which defines the lifecycle of the data and various implemented controls governing availability, usability, consistency, data integrity, and data security. Included in data governance are other documents such as Master Data Management, which provides a common definition for the reference data found across the different source systems. The Data Dictionary defines the data elements to be extracted and their transformed layout for loading into the data warehouse. Data privacy ensures that personally identifiable information (PII) is properly protected.

With the data layer covered, let us discuss the data scientist who will be doing the analytics. Harkening back to an undergraduate class is econometrics, any analysis begins with a question or hypothesis to be tested. The iterative methodology is defined with the following steps:

Develop a hypothesis
Refine the hypothesis into a mathematical model
Align data elements to the model’s variables
Check the model's adequacy; run statistical analysis to ensure variables are independent
Test the hypothesis against the derived model
Use the model for prediction and forecasting

To be effective, a data scientist needs these skills:

Computer science: Knows how to write code, understands databases, and Big Data architectures
Advanced math skills: Trigonometry and geometry skills (many of the algorithms used to identify proximity of like items are nonlinear)
Quantitative analyst: Statistics background, visual analytics, experience with unstructured data
Scientist: Evidence-based decision making; an understanding of the scientific method
Strong communications skills: Ability to frame the topic for understanding

Now with the data layer identified and the data scientist introduced, let us talk about outcomes. Big Data analytical activities can be organized into two broad categories: an inward focus and an outward focus. Inward focus on topical areas, such as cost reduction, decision improvement, and improvements in products and services, is nothing new. However, the three Vs of Big Data can help improve products and services. For example, with the onset of the Internet-of-Things, appliances and automobiles report service issues back to manufacturers that then use this data to improve products and services. An outward focus on opportunities, changes, and threats is also supported by Big Data.

You can think of Big Data as the third wave of the Industrial Revolution, with each wave defined by the main source of energy: steam → electricity → data. Labor becomes more specialized with each new wave. Similarly, you can think of the last 50 years of the Computer Age defined by computing → networking → Big Data. The computing phase started with the mainframe computer and evolved to include personal computers and now smartphones. The networking phase began with local area networks in the 1980s and the internet in the 1990s. The computing and networking phases provided the technology to support parallel processing and huge distributed datasets. Equally important, organizations are now enabled to think more broadly than simply recording transactions.

Finding skilled data scientists is key to the effective use of Big Data for focused analytical activities. But so, too, is finding the right balance between applying the scientific method (hypothesis → experimentation → customer reaction observation → adjustment) for growth opportunities and maintaining daily operations.

Definitions

Data lake: A repository where data is stored in its raw format

Data warehouse: A repository for integrated data from multiple source systems

Data swamp: Unmanaged data that no longer provides value

Shaping Our Future Together: 125 Strong

The AAO-HNS/F Boards of Directors recently passed the Strategic Plan.

Models of Otolaryngology Practice: Business of Medicine—A Critical Component for ALL Providers

The 2021 AAO-HNS Strategic Plan was recently approved by our Board of Directors.

Out of Committee: Outcomes Research and Evidence-Based Medicine | Growing the Evidence Base for Healthcare Disparities and Social Determinants of Health Research in Otolaryngology–Head and Neck Surgery

Take two men, Ben and Larry, both diagnosed with oropharyngeal cancer in 2015.

May Is Better Hearing and Speech Month │ Out of Committee: Hearing | Cochlear Implantation for Unilateral Hearing Loss in Adults and Children

Unilateral severe-to-profound sensorineural hearing loss has been estimated to affect between 12 and 27 per 100,000 adults in the United States and 0.4 to 3.4 per 1,000 live births.

From the Education Committees | Sublingual Immunotherapy: Is One Pill All I Need?

Allergic rhinitis is a highly prevalent disease in the United States population affecting between 10% and 30% of the general population.

Get to Know the #OTOMTG21 International Guests of Honor: Australian Society of Otolaryngology Head and Neck Surgery

The Australian Society of Otolaryngology Head and Neck Surgery (ASOHNS) is the representative organization for otolaryngology head and neck surgeons in Australia.

Get to Know the #OTOMTG21 International Guests of Honor: Brazilian Association of Otorhinolaryngology and Cervical-Facial Surgery

The Brazilian Association of Otorhinolaryngology and Cervical-Facial Surgery (ABORL-CCF) is a civil society of a scientific, nonprofit nature and legitimate representative of the professional class of otorhinolaryngology.

Get to Know the #OTOMTG21 International Guests of Honor: Society of Polish Otorhinolaryngologists, Audiologists and Phoniatrists

Otolaryngology-head and neck surgery is an essential and significant medical specialty in Poland.

Honoring Our Past and Forging Ahead: Celebrating our 125th Annual Meeting & OTO Experience with Innovative Program Enhancements

This year marks the 125th anniversary of our organization that was founded in 1896.

Tech Talk: Growing Role of Big Data

Where Is Big Data Stored?

Definitions

Shaping Our Future Together: 125 Strong

Models of Otolaryngology Practice: Business of Medicine—A Critical Component for ALL Providers

Out of Committee: Outcomes Research and Evidence-Based Medicine | Growing the Evidence Base for Healthcare Disparities and Social Determinants of Health Research in Otolaryngology–Head and Neck Surgery

Adding Allergy to your Otolaryngology Practice

AAOA Publishes "Targeted Molecular Therapies in Allergy and Rhinology” (Damask C. et al) to Help Otolaryngologists

Tele-otolaryngology: Through the Pandemic, and Beyond | Interim Findings of the Study of Telehealth in Otolaryngology

May Is Better Hearing and Speech Month │ Out of Committee: Hearing | Cochlear Implantation for Unilateral Hearing Loss in Adults and Children

From the Education Committees | Sublingual Immunotherapy: Is One Pill All I Need?

Get to Know the #OTOMTG21 International Guests of Honor: Australian Society of Otolaryngology Head and Neck Surgery

Get to Know the #OTOMTG21 International Guests of Honor: Brazilian Association of Otorhinolaryngology and Cervical-Facial Surgery

Get to Know the #OTOMTG21 International Guests of Honor: Brazilian Society of Head and Neck Surgery

Get to Know the #OTOMTG21 International Guests of Honor: Otorhinolaryngology Society of Ghana

Get to Know the #OTOMTG21 International Guests of Honor: New Zealand Society of Otolaryngology, Head and Neck Surgery

Get to Know the #OTOMTG21 International Guests of Honor: Society of Polish Otorhinolaryngologists, Audiologists and Phoniatrists

Attend the 2021 Humanitarian Efforts Forum at the AAO-HNSF 2021 Annual Meeting & OTO Experience

Global Young Physicians Unite at the AAO-HNSF 2021 Annual Meeting & OTO Experience

Spotlight: Humanitarian Efforts | Robert J. Sinard, MD

Humanitarian Travel Grant: Medical Mission in Nepal

Section Spotlight: Taking Peer Mentorship to the Next Level

2021 RRDP and International Community Curriculum Spotlight

Serve as a mENTor to a Future Otolaryngologist

NOW LIVE: AAO-HNSF Quality Manuscript on Age-Related Hearing Loss Measures

Clinical Practice Guideline: Opioid Prescribing for Analgesia After Common Otolaryngology Operations

2021 AAO-HNS Annual Election Now Open

Education Opportunities in Allergy

Call for Science: Late-Breaking Abstracts

May Is Better Hearing and Speech Month

AAO-HNSF 2021 Annual Meeting & OTO Experience: Hotel Locations

Milestone Moments

AAO-HNSF 2021 Annual Meeting & OTO Experience: New Innovative Program Offerings for 2021

Health Equity and Inclusive Diversity: Why it Matters

Honoring Our Past and Forging Ahead: Celebrating our 125th Annual Meeting & OTO Experience with Innovative Program Enhancements