Originally written: May 2023 | Published on Gorilla PE Insights: March 2026

 

This article is a public release of an internal partner memo shared in May 2023.

 


 

In May 2023, we set out to establish the first true benchmark of the AI race. At the time, market sentiment was violently split into two camps: one declaring that "the world will never be the same," and the other dismissing it as "just another tech bubble." Our judgment was that both camps were wrong. Yes, the world is fundamentally changing—but the beneficiaries of that change will not be evenly distributed. And no, this is not a bubble.

 

This piece is not merely a verdict on OpenAI as a single company. It is a documented attempt to recognize AI as a structural paradigm shift and to ask where the first definitive tollgate of that shift is being built.

 


 

Part 1. Why Now? — Breaking the Threshold of 60 Years of Research

 

From Neural Networks to ChatGPT: This Didn't Happen Overnight

 

To truly understand ChatGPT, you cannot start in 2022; you have to rewind to the 1960s. The concept of Artificial Neural Networks (ANNs) has existed for decades. However, for a long time, AI cycled through brutal "winters" and brief "springs." The theoretical ideas were there, but three critical components were missing: massive data, raw compute power, and the right architecture.

 

2012: The Resurrection of Deep Learning. At the ImageNet competition, AlexNet—based on Convolutional Neural Networks (CNNs)—crushed traditional methods. It achieved an error rate of 15.3%, less than half of the previous best record (26.2%). It was the exact moment deep learning proved it could approach human-level accuracy in image recognition.

 

2017: The Dawn of the Transformer. Google published the seminal paper, "Attention Is All You Need." This is the 'T' in GPT. The breakthrough was the Attention Mechanism, which processes the relationships between all words in a sentence in parallel. This fundamentally solved the structural limitations of legacy Recurrent Neural Networks (RNNs), which tended to "forget" earlier context in long passages. At the WMT (Workshop on Machine Translation) 2017–2018 competitions, Transformer-based models improved BLEU scores by over 30% compared to legacy RNNs. For the first time, a quantitative leap in linguistic fluency was measurable and proven.

 

2020: GPT-3. A model boasting 175 billion parameters debuted. It was qualitatively different from anything that came before it. Yet, its practical utility for the average user remained limited.

 

Late 2022: ChatGPT. OpenAI added RLHF (Reinforcement Learning from Human Feedback) into the mix. If GPT-3.5 was a "researcher's experimental sandbox," ChatGPT became a "tool anyone could use perfectly on their first try." It hit 10 million users in 40 days, and 100 million in two months. For context, it took Instagram 11 months to hit that same milestone.

 

Bubbles form in a vacuum. ChatGPT didn't. It emerged at the exact intersection of 60 years of foundational research, a decade of deep learning acceleration, the Transformer architectural breakthrough, and RLHF. A technology that shatters a threshold of this magnitude does not simply regress back below it.

 


 

Scaling Laws: The Brutal Physics of Getting Smarter — And the Inevitability of Oligopoly

 

The second key to unlocking the ChatGPT phenomenon is the Scaling Law. Empirically validated in a 2020 paper by OpenAI's Jared Kaplan, Dario Amodei, and others ("Scaling Laws for Neural Language Models"), the premise is ruthlessly simple: A model’s performance scales non-linearly as you simultaneously increase three variables: Parameters, Data, and Compute.

 

To put numbers to it: If you increase a model's parameters by 10x, the required data and compute to train it increase by roughly 1,000x. This is the core asymmetry of the Scaling Laws. DeepMind’s 2022 Chinchilla research further refined this, revealing that for compute-optimal training, the ratio of model parameters to training tokens should be roughly 1:20.

 

Beyond predictable performance gains, the Scaling Laws trigger another phenomenon: Emergence. Formally articulated in a June 2022 paper by Google’s Jason Wei et al. ("Emergent Abilities of Large Language Models"), the concept states: "Abilities that do not exist in smaller models suddenly appear in larger models—and this phenomenon cannot be predicted by simply extrapolating the performance of the smaller models." GPT-3.5 hovered in the bottom 10% of the Uniform Bar Exam. GPT-4 passed in the top 10%. By merely scaling parameters, its mathematical reasoning leaped to the top 1% of the Biology Olympiad. Even its creators couldn't predict that.

 

To train a model at or beyond the GPT-4 level, you need a dataset equivalent in scale to the entire text of the public internet. Globally, there are perhaps four or five organizations that can absorb this cost. In the foundational AI model race, an oligopoly is structurally inevitable.

 


 

Part 2. The First Benchmark of the AI Race — The May 2023 Landscape

 

The Three Conditions for Victory

 

Winning the AI race requires three distinct pillars: a proprietary data flywheel, technological velocity, and a monetization backbone. As of May 2023, OpenAI is the only company that possesses all three simultaneously.

 

The Data Flywheel: The prompts and feedback generated daily by ChatGPT's 100M+ MAUs are fed straight back into model training. More users create more data, more data builds a better model, and a better model attracts even more users.

 

Technological Velocity: It took two years to go from GPT-3 to GPT-4. Compared to GPT-3.5, GPT-4 reduces the likelihood of responding with disallowed content by 82% and increases factual responses by 40%.

 

The Monetization Backbone (Microsoft): Microsoft poured $10B+ into OpenAI, securing roughly a 49% stake. GitHub, Azure, Bing, Office 365—OpenAI’s tech is being aggressively integrated into Microsoft's entire product suite, which commands a 90% global market share in office software. This distribution channel cannot be replicated by money or time alone.

 

The Data Proves It’s Not a Bubble

 

The MIT Study (Noy & Zhang, 2023): In a randomized controlled trial of 453 college-educated professionals, the group using ChatGPT completed tasks 40% faster while producing work of 18% higher quality. Notably, the most significant improvements were seen among those with lower initial writing skills. AI actually narrowed productivity inequality.

 

GitHub Copilot (Microsoft, 2023): In a controlled study of developers, the cohort using Copilot completed a Java web server task 55% faster than the control group.

 

The Competitor Landscape (As of May 2023)

 

Company

Valuation

Key Partnership

Core Strength

Structural Limitation

OpenAI

$29B

Microsoft $10B+ (Full Suite Integration)

Data Flywheel + Distribution

Early stages of pure 

monetization

Google

Public

Internal

Best-in-class Data, Research, TPUs

The Innovator's Dilemma (Search cannibalization)

Anthropic

$4B

Google $300M (Cloud only)

AI Safety & Alignment Research

Disadvantage in Capital & Distribution

Meta

Public

Internal

Dominant Open-Source Ecosystem

Lacks an independent 

direct-revenue AI model

 


 

Part 3. Counterarguments and Responses

 

Hallucinations: The structural limitations of LLMs are very real. However, this does not invalidate the thesis that "AI will replace a significant portion of cognitive labor." The Noy & Zhang study empirically proved that humans partnered with flawed AI still vastly outperform humans working alone.

 

The "$29B Valuation is a Bubble" Argument: Looking at forward revenue projections for 2024, OpenAI's multiple converges to under 30x Forward PSR. Compared to Snowflake trading at 112x or Palantir at 40x shortly after their IPOs, this valuation is highly defensible for a category king.

 

The Limits of Scaling: Concerns that we will soon exhaust all human-generated text on the internet are entirely valid. But paradoxically, as this constraint tightens, the relative advantage of players who already own proprietary, closed-loop data sources will only amplify.

 


 

Conclusion: The Race Has Begun

 

This piece is not a declaration that OpenAI will reign supreme forever. It is a snapshot of the first critical question we asked in May 2023: Is AI a structural paradigm shift, and who currently meets the conditions to clear its first gate?

 

Our judgment is that, as of May 2023, OpenAI was the sole entity checking all three boxes—data flywheel, velocity, and distribution. Google has the firepower to flip the board. If regulatory scrutiny around AI safety intensifies, entirely new layers (like Anthropic) could rise. The very rules of the game may change.

 

But the most important takeaway is simply this: the game has definitively begun.

 


 

[Subsequent Developments] 

 

The AI race converged into a tight oligopoly of foundation model players faster than expected. OpenAI's ARR surpassed $3.4B by 2025; Google Gemini rebounded with surprising strength; and Anthropic rapidly captured enterprise market share. The core question posed in May 2023—whether the AI race would inevitably consolidate into an oligopoly—proved accurate. The internal ranking of that oligopoly remains an ongoing battle.

 


 

[Key Citations]

 

Kaplan et al., "Scaling Laws for Neural Language Models," arXiv, 2020.01

 

Hoffmann et al. (DeepMind), "Chinchilla," arXiv, 2022.03

 

Wei et al. (Google), "Emergent Abilities of Large Language Models," arXiv, 2022.06

 

Vaswani et al., "Attention Is All You Need," NeurIPS, 2017

 

Noy & Zhang, "Experimental Evidence on the Productivity Effects of Generative AI," SSRN, 2023.03

 

This article does not constitute investment advice.