Who Holds the Reins of Data Power?

Who controls data and why that equals power

Data is far from neutral or merely raw; it functions as a strategic resource. The party that gathers, stores, interprets, and oversees extensive, high‑quality datasets secures economic leverage, political sway, and operational authority. That concentrated ability to anticipate behavior, influence markets, guide information flows, and execute large‑scale decisions is what ultimately transforms data into power.

Primary stakeholders responsible for managing data

  • Big technology platforms: Companies spanning global search, social networks, cloud ecosystems, and ecommerce services accumulate vast volumes of behavioral, transactional, and location-based information derived from billions of users and activities.
  • Governments and regulators: States gather identity, taxation, health, telecom, and surveillance records, while also defining the policies that govern how data may be accessed and utilized.
  • Data brokers and aggregators: Businesses that acquire, enhance, and market consumer profiles, frequently merging public documents, purchasing histories, and inferred attributes for marketing or analytics.
  • Enterprises with vertical stacks: Healthcare networks, financial institutions, retail groups, and telecommunications firms maintain specialized and sensitive datasets tied to measurable real‑world outcomes.
  • Research institutions and public bodies: Universities and national statistical offices generate and curate scientific, demographic, and environmental data aimed at serving the public good.
  • Individuals and communities: People produce data through daily activities, consumption, and interactions; coordinated action and regulatory protections can gradually restore meaningful control to them.

Categories of data that grant influence

  • Personal identifier data: Names, government IDs, addresses — used for control, authentication, and enforcement.
  • Behavioral and interactional data: Search queries, clicks, watch history, social graphs — the raw materials for personalization and persuasion.
  • Transactional and financial data: Purchases, pricing, credit records — key to economic profiling and dynamic pricing strategies.
  • Sensor and IoT data: Location traces, device telemetry, smart home logs — enable continuous monitoring and context-aware services.
  • Biometric and genomic data: Fingerprints, facial data, DNA — uniquely sensitive inputs for identity, health research, and forensic uses.

How data control translates into power: mechanisms and effects

  • Economic moat and market power: Large data sets improve machine learning models, which improve products, driving more users and more data — a virtuous cycle that erects barriers to entry. Example: search and ad targeting have concentrated advertising markets because better data yields higher ad relevance and revenue.
  • Predictive advantage: Accurate predictions about behavior enable firm decisions that tilt outcomes in their favor: targeted advertising, credit scoring, fraud detection, inventory optimization.
  • Behavioral influence and information control: Platforms control what content is amplified or suppressed through recommendation algorithms. The Cambridge Analytica case (where harvested Facebook data was used to target political messaging) exemplifies how behavioral data can be weaponized for persuasion.
  • Gatekeeping and platform governance: Owners of dominant platforms can set rules for third parties, controlling market access and terms for competitors — for example, marketplace platforms that combine seller data with platform-owned products gain insights that can disadvantage independent sellers.
  • Surveillance and social control: Centralized access to communication, movement, and transactional data enables monitoring at scale. Government programs and private analytic tools can be combined to build predictive policing, eligibility systems, or social scoring mechanisms.
  • National security and geopolitical leverage: Nations with advanced digital ecosystems and access to strategic data (telecoms, critical infrastructure telemetry, citizen registries) gain operational intelligence and bargaining power in diplomacy and conflict.

Notable cases and key data insights

  • Cambridge Analytica (2016–2018): Harvested Facebook user data to build psychological profiles for highly targeted political advertising, highlighting risks of third‑party access and opaque reuse.
  • Platform ad ecosystems: Google and Meta have historically captured major shares of digital advertising by combining search, social, and targeting data to sell precise audiences to advertisers.
  • Amazon marketplace dynamics: Amazon uses sales and search data across the platform to optimize its logistics, recommend products, and develop private‑label items — creating conflicts between marketplace operator and sellers.
  • Health data partnerships: Consumer genetics companies and health apps have partnered with pharmaceutical firms to accelerate drug discovery, illustrating how aggregated health data can be monetized with both public benefit and commercial profit.
  • Regulatory responses: The EU General Data Protection Regulation (implemented 2018) redefined data controller and processor responsibilities and introduced rights like data portability and the right to erasure; Apple’s App Tracking Transparency (2021) changed mobile ad tracking economics by restricting cross‑app IDFA access.

Consequences for markets, democracy, and equity

  • Market concentration: Data-driven advantages favor incumbents, reducing competition and slowing innovation in some sectors.
  • Privacy erosion and reidentification risk: Even «anonymized» datasets can be reidentified when combined with other sources, exposing sensitive information.
  • Discrimination and bias: Models trained on biased data reproduce and scale unfair outcomes in credit, hiring, policing, and healthcare.
  • Information manipulation: Targeted messaging informed by granular data can polarize electorates, manipulate attention, and distort public discourse.
  • Asymmetric bargaining power: Individuals and small organizations often lack leverage to negotiate fair terms for data use, while data brokers monetize profiles with opaque provenance.

Tools across policy, technology, and governance to restore a balanced distribution of power

  • Regulation and antitrust: Enforceable rules for data portability, interoperability, and dominant platform obligations can reduce gatekeeper power. Enforcement examples include privacy fines and ongoing antitrust scrutiny of major platforms.
  • Data minimization and purpose limitation: Limiting collection to what is necessary and requiring clear, specific purposes reduces surveillance risks and secondary misuse.
  • Data portability and open standards: Allowing consumers to move data between services and using standardized APIs lowers switching costs and encourages competition.
  • Privacy‑preserving technologies: Techniques like federated learning, differential privacy, and secure multi‑party computation enable model training and analytics without centralizing raw personal data.
  • Data trusts and stewardship models: Independent custodians can manage sensitive datasets with fiduciary responsibilities, ensuring ethical access for research and public interest use.
  • Transparency and auditability: Mandating model explanations, provenance records, and third‑party audits helps detect misuse and bias.

Actionable guidance for both organizations and individuals

  • For organizations: Build clear data governance frameworks, map data flows, apply privacy‑by‑design, use synthetic data or privacy techniques when possible, and publish transparency reports about data use and model impacts.
  • For individuals: Use privacy controls, limit permissions, exercise data rights where available (access, deletion, portability), and prefer services that practice minimal collection and transparency.

Data control is not just a technical or commercial issue; it shapes who can influence markets, elections, scientific priorities, and everyday life. Power accrues where data flows are monopolized, where inference capabilities are concentrated, and where governance is opaque. Rebalancing that power requires coordinated legal frameworks, technical safeguards, institutional design, and cultural norms that recognize data as both an economic resource and a collective social trust.

By Jasmin Rodriguez