Data is not neutral raw material; it is a strategic asset. The entity that collects, stores, analyzes, and governs large, high‑quality data sets gains economic advantage, political influence, and operational control. That concentration of capability — to predict behavior, set markets, shape information flows, and make decisions at scale — is what turns data into power.
Key actors who control data
- Big technology platforms: Companies like global search, social media, cloud, and ecommerce platforms aggregate massive behavioral, transactional, and location data across billions of users and services.
- Governments and regulators: States collect identity, tax, health, telecommunications, and surveillance data; they also set rules that determine who may use what data and how.
- Data brokers and aggregators: Firms that buy, enrich, and resell consumer profiles, often combining public records, purchase history, and inferred attributes for marketing or analytics.
- Enterprises with vertical stacks: Healthcare providers, banks, retailers, and telcos that hold specialized, sensitive datasets linked to real-world outcomes.
- Research institutions and public bodies: Universities and statistical agencies produce and steward scientific, demographic, and environmental data for public benefit.
- Individuals and communities: End users create data by living, consuming, and interacting; collective action and legal frameworks can shift practical control back toward them.
Types of data that confer influence
- Personal identifier data: Names, government IDs, addresses — used for control, authentication, and enforcement.
- Behavioral and interactional data: Search queries, clicks, watch history, social graphs — the raw materials for personalization and persuasion.
- Transactional and financial data: Purchases, pricing, credit records — key to economic profiling and dynamic pricing strategies.
- Sensor and IoT data: Location traces, device telemetry, smart home logs — enable continuous monitoring and context-aware services.
- Biometric and genomic data: Fingerprints, facial data, DNA — uniquely sensitive inputs for identity, health research, and forensic uses.
How exercising authority over data becomes a source of power: underlying mechanisms and resulting impacts
- Economic moat and market power: Large data sets improve machine learning models, which improve products, driving more users and more data — a virtuous cycle that erects barriers to entry. Example: search and ad targeting have concentrated advertising markets because better data yields higher ad relevance and revenue.
- Predictive advantage: Accurate predictions about behavior enable firm decisions that tilt outcomes in their favor: targeted advertising, credit scoring, fraud detection, inventory optimization.
- Behavioral influence and information control: Platforms control what content is amplified or suppressed through recommendation algorithms. The Cambridge Analytica case (where harvested Facebook data was used to target political messaging) exemplifies how behavioral data can be weaponized for persuasion.
- Gatekeeping and platform governance: Owners of dominant platforms can set rules for third parties, controlling market access and terms for competitors — for example, marketplace platforms that combine seller data with platform-owned products gain insights that can disadvantage independent sellers.
- Surveillance and social control: Centralized access to communication, movement, and transactional data enables monitoring at scale. Government programs and private analytic tools can be combined to build predictive policing, eligibility systems, or social scoring mechanisms.
- National security and geopolitical leverage: Nations with advanced digital ecosystems and access to strategic data (telecoms, critical infrastructure telemetry, citizen registries) gain operational intelligence and bargaining power in diplomacy and conflict.
Notable cases and key data insights
- Cambridge Analytica (2016–2018): Harvested Facebook user data to build psychological profiles for highly targeted political advertising, highlighting risks of third‑party access and opaque reuse.
- Platform ad ecosystems: Google and Meta have historically captured major shares of digital advertising by combining search, social, and targeting data to sell precise audiences to advertisers.
- Amazon marketplace dynamics: Amazon uses sales and search data across the platform to optimize its logistics, recommend products, and develop private‑label items — creating conflicts between marketplace operator and sellers.
- Health data partnerships: Consumer genetics companies and health apps have partnered with pharmaceutical firms to accelerate drug discovery, illustrating how aggregated health data can be monetized with both public benefit and commercial profit.
- Regulatory responses: The EU General Data Protection Regulation (implemented 2018) redefined data controller and processor responsibilities and introduced rights like data portability and the right to erasure; Apple’s App Tracking Transparency (2021) changed mobile ad tracking economics by restricting cross‑app IDFA access.
Consequences for markets, democracy, and equity
- Market concentration: Data-driven advantages favor incumbents, reducing competition and slowing innovation in some sectors.
- Privacy erosion and reidentification risk: Even “anonymized” datasets can be reidentified when combined with other sources, exposing sensitive information.
- Discrimination and bias: Models trained on biased data reproduce and scale unfair outcomes in credit, hiring, policing, and healthcare.
- Information manipulation: Targeted messaging informed by granular data can polarize electorates, manipulate attention, and distort public discourse.
- Asymmetric bargaining power: Individuals and small organizations often lack leverage to negotiate fair terms for data use, while data brokers monetize profiles with opaque provenance.
Tools across policy, technology, and governance to restore a balanced distribution of power
- Regulation and antitrust: Binding requirements on data portability, interoperability, and duties for dominant platforms can curb gatekeeper influence, with enforcement actions such as privacy penalties and continuous antitrust investigations targeting major platforms.
- Data minimization and purpose limitation: Collecting only what is essential and demanding explicit, well‑defined purposes helps reduce surveillance exposure and limits unauthorized secondary uses.
- Data portability and open standards: Enabling users to transfer their information across services and adopting uniform APIs lowers switching barriers while stimulating broader market competition.
- Privacy‑preserving technologies: Approaches including federated learning, differential privacy, and secure multi‑party computation make it possible to train models and run analyses without aggregating raw personal information in a single location.
- Data trusts and stewardship models: Independent stewards can oversee sensitive data under fiduciary duties, providing responsible access for research and activities serving the public interest.
- Transparency and auditability: Requiring model interpretability, traceable provenance, and external audits supports the identification of improper use and potential bias.
Actionable guidance for both organizations and individuals
- For organizations: Establish clear data governance structures, chart how information moves across systems, integrate privacy‑by‑design principles, rely on synthetic data or privacy techniques whenever appropriate, and release transparency reports detailing data practices and model effects.
- For individuals: Adjust privacy settings, restrict app permissions, invoke available data rights such as access, deletion, and portability, and choose services committed to minimal data collection and open disclosure.
Data control extends far beyond technical or commercial concerns; it ultimately determines who can shape markets, steer elections, set scientific agendas, and influence daily life. Power accumulates wherever data streams become exclusive, inference tools are centralized, and oversight remains unclear. Restoring balance calls for aligned legal structures, robust technical protections, thoughtful institutional arrangements, and shared cultural expectations that treat data both as an economic asset and as a form of collective social trust.
