Orply.

AI Will Commoditize Legal Work Product, Not Legal Judgment

Harvey co-founder and chief executive Winston Weinberg argues that AI will commoditize much of the routine work product in law while increasing the value of judgment at the point where legal decisions are made. In a Knowledge Project interview with Shane Parrish, Weinberg describes how Harvey grew from a GPT-3 test on landlord-tenant questions into an $11bn legal AI company, and explains the operating discipline behind it: faster decisions, sharper prioritization, and a team built to withstand repeated failure.

Harvey’s bet is that legal work changes around the decision point

Winston Weinberg describes Harvey as a system that lets lawyers use frontier models to do legal work “in a much better way and much more streamlined way.” The broader ambition is not merely to sell software into law firms. It is to track the progress of AI models and apply that progress to a profession whose processes, staffing, and service delivery may change as the models improve.

Weinberg’s shorthand is that Harvey is trying to build something like a “legal brain.” In medicine, he said, the analogous effort would be a “medical brain.” But the phrase matters less as branding than as a description of the design problem: deciding where a model should do the work, where a human should review it, and how legal departments and law firms should reorganize the delivery of services once that division changes.

That distinction runs through his view of the profession. He separates professional-services work into two categories: work product, and decision or advice. The first category includes things like contract drafting, large-scale contract review, and other tasks whose value historically came from producing, checking, or organizing the material that supported a decision. Weinberg expects that category to be commoditized.

“The reviewing of those contracts is going to get automated, for sure,” he said. “The decision at the top is not.”

The reason, in his view, is that high-end legal value already sits at the point where experience, context, judgment, and interpersonal knowledge matter. In a large M&A transaction, for example, the best lawyers are not differentiated primarily because they can draft a stock purchase agreement or work through a diligence room faster than everyone else. They know the principals, understand the real commercial interests behind stated positions, and can tell when nine demands are smokescreens and one is the issue that actually matters.

In litigation, Weinberg made a similar point about trial lawyers. AI may improve the ability to test arguments, use data, and generate alternatives. But the lawyer who can read a courtroom, understand the jury, and choose the story that will land is still exercising a human form of judgment. His claim is not that AI makes legal excellence irrelevant. It is that AI strips away more of the surrounding labor, causing the premium to accrue more heavily to the real decision points.

That is why his advice to law students is not to abandon the traditional first-year legal curriculum. He thinks the first year of law school remains valuable precisely because it teaches critical thinking, argument, research, and analytical habits, even when many of the old cases are no longer “good law.” The second and third years, by contrast, need much more hands-on experience in his view. Students should learn clients, industries, and the practical settings in which legal judgment is exercised.

“If you think that your competitive advantage is you are the best at writing briefs, or you are the fastest at doing research or something like that, that was never your competitive advantage,” he said. The better advantage is understanding clients, industries, negotiation dynamics, and the decision context around legal work. As Weinberg put it, “The value is going to accrue to those decision points instead of all the work that's being done.”

That is also where he sees AI increasing the returns to small differences in skill. He compared it to sports: a player who is only slightly faster repeatedly gets to the ball earlier, raises a hand a little higher, or interrupts a pass just in time. Small advantages compound into repeated wins. Knowledge work, he argued, is beginning to behave the same way. If an attorney is slightly better or faster, that attorney may attract a disproportionate share of the work.

Law firms already have a version of this in rainmaker partners, whose advantages may be incremental across many dimensions but compound into a large share of client demand. AI could make that pattern more visible and more extreme. It may also pressure lockstep institutions to change. In many law firms, associates progress at roughly the same pace until the partnership decision years later. Weinberg thinks firms will have to identify when a junior associate is better at a specific skill and promote faster rather than wait for seniority to catch up.

The same logic informs how Harvey promotes internally. Weinberg said the company hires executives who have “been there, done that,” but also promotes people with high raw talent whose slope may exceed that of a more senior external hire. The important question becomes not just what a person knows now, but how fast they learn, adapt, and compound.

The company began with a test lawyers could not easily dismiss

The original Harvey insight did not come from a broad market thesis. It came from a small test of whether GPT-3 could produce legal answers that real lawyers would be willing to send.

Weinberg met his co-founder, Gabe, while Gabe was working in AI research. Gabe had been in the first Google Brain class and later worked at Meta, where he was trying to push large language models. Weinberg, then a lawyer, asked him to show what an LLM was. The available tool was the public GPT-3 API in early 2022. Weinberg first used it to help run a Dungeons and Dragons game, where he found it surprisingly effective. Then he used it on a pro bono landlord-tenant matter in California, feeding it fact patterns and statutes and asking it to apply one to the other.

The outputs were good enough that he and Gabe designed a rough validation exercise. They gathered landlord-tenant questions from r/legaladvice, a subreddit where users ask legal questions that often end with some version of “can I sue somebody?” They ran the questions through a chain-of-thought prompt and gave the answers to three landlord-tenant lawyers. They did not tell the lawyers the answers were AI-generated. They framed each as an answer from a lawyer and asked whether the lawyer would send it with zero edits.

On 86 out of 100 questions, all three attorneys said yes.

86/100
landlord-tenant answers that all three reviewing attorneys said they would send with zero edits

That was Weinberg’s “oh my god” moment. The result was enough to convince him and his co-founder that the models were already materially useful in law and would become more so with even modest improvement.

They emailed the results to Sam Altman and Jason Kwon at OpenAI. The body of the email, Weinberg said, was essentially: “Did you know that the models were this good at legal?” They met first with Kwon, showed the Reddit process and prompts, then met OpenAI’s leadership team on July 4 to pitch the company. OpenAI became the original investor. Weinberg said they did not go to other venture capital firms at the time; he was not from tech, and the case was built around the experiment itself.

The challenge after that was less theoretical viability than attention. Lawyers were busy and skeptical. Early demos sometimes failed because lawyers simply did not care. Weinberg said he would show what he thought at the time looked like “AGI itself,” and the lawyers would look at their phones.

So he changed the demo. As a former litigator, he used public filings in federal cases. He would find a lawyer’s recent brief, load it into Harvey, and ask the system what was strong about the argument, how to argue against it, or how to poke holes in a contention. The difference was immediate. A generic AI demo was easy to ignore. An AI critique of the brief the lawyer had filed last week was not.

The approach was risky because hallucinations could kill the meeting. If Harvey got something wrong, the lawyer could write it off as useless and leave. But when the system worked on the lawyer’s own work, it created the felt sense of what the technology could do. Weinberg said personalization mattered because people often do not understand the product until they feel it on something they personally worked on.

Harvey used an even riskier version of that strategy during its Series A pitch to Sequoia. Sequoia brought in lawyers they knew. Weinberg selected a use case, found an argument the lawyers had made, and used Harvey to analyze the brief. Pat Grady later emailed him asking whether that was the normal demo because it had gotten the lawyers “really riled up.” In that case, the analysis was good and the strategy worked.

The early skepticism Weinberg encountered has not disappeared. He said some people tried ChatGPT or Harvey years ago, saw a hallucination, and still anchor to that first impression. Harvey has had prospects who trialed the product three years earlier, concluded it was not good enough, and returned later to find it unrecognizable. Weinberg thinks non-technical industries are not accustomed to the rate of change in AI, where every release can be a “massive step change.” That makes it hard for buyers to build mental models that are resilient to rapid improvement.

The future law firm may use fewer people per matter and still handle more work

Shane Parrish pressed Weinberg on whether law firms will become smaller as AI systems and agents take over more work. Weinberg resisted the simple answer. Law firms will need fewer people per project, he said, but they may handle many more projects. AI could also let firms institutionalize knowledge faster and scale across matters in ways they could not before.

His five-year guess is that agents will run a large portion of legal work, while humans become better at reviewing agent outputs and identifying where agents may have made mistakes. That resembles part of what senior lawyers already do: review, spot risk, and apply judgment rather than mechanically perform every task.

The demand side may also expand. Weinberg expects significant regulation around AI use, comparable in spirit to audit restrictions in banking. Someone will have to monitor, interpret, and administer those rules, and he expects much of that burden to fall on legal departments. If companies use AI to generate and negotiate contracts, the resulting data rooms may become vastly larger. He suggested a future in which agents create and negotiate documents with each other, producing a level of transactional complexity no human can absorb alone.

In that environment, the human-only lawyer is not replaced by an AI-only system so much as made insufficient. “It has to be an agent plus a human,” he said, because otherwise the complexity cannot be managed.

He used email as an analogy for how technology changes expectations. Email did not merely make existing legal communication faster; it reset the client’s expectation for how quickly work product should come back. AI, in his view, will do the same at a higher level. A firm that can close a deal in 48 hours will have an advantage over one that takes three weeks, especially when delay creates risks such as leaks, cold feet, or stock-price movement.

Quality in this context means both faster and more accurate. But Weinberg warned against analyzing legal AI in a vacuum. Top law firms draw substantial revenue from M&A and other transactional work, which is heavily cyclical. The question is not only whether LLMs can do legal work today. It is what legal work will become when clients themselves are operating with AI. If Meta can ship 50 times as many products daily, the product-review function for lawyers changes as well. More automation on the client side may produce more legal surface area, not less.

An all-AI law firm, however, runs into legal constraints. In the United States, Weinberg said, two rules block the simple version of calling an AI system, giving it facts, and having it file briefs and provide advice. The first is the unauthorized practice of law, which operates state by state and can be a felony. The second is an ABA ethical rule restricting non-lawyer investment in law firms. Together, he said, these rules make the model impossible except in Arizona and Utah, where regulatory sandboxes have changed the rules.

Disclosure obligations around AI use are also unsettled and depend heavily on jurisdiction and client. Weinberg described a rapid shift in client expectations. In 2023, many clients said not to use AI at all, or to disclose exactly how it was being used in small use cases. In 2024, clients began allowing AI by matter type: acceptable for some work, prohibited for other work. By 2025, the posture in some cases had become: you need to use it, you need to explain how, and you need to show how it saves money.

That last point leads to a practical tension. Some clients are not seeing legal bills fall even when their firms use tools like Harvey. Weinberg gave two explanations. First, current tools often automate tasks rather than entire workflows. If AI helps with pieces of diligence but does not automate the entire diligence process, it can be hard to calculate exactly how much time was saved and how that should flow through to pricing. He expects that to change quickly as workflow automation improves.

Second, in-house teams have lagged law firms in adoption, often by about a year. Without direct experience using the tools, they have less visibility into what law firms could automate. Weinberg thinks more communication is needed between in-house legal teams and outside counsel. Harvey’s product direction, as he described it, is partly about building a collaborative environment where the in-house team, the law firm, and AI work together: the client handles what it wants internally, the firm applies expert data and judgment, and AI helps both sides coordinate and execute.

Weinberg’s operating system starts with one document and one bottleneck

Winston Weinberg centers his management system on a Google document called “the list.” It is large — Parrish referred to it as a 200-page or 400-page document — but Weinberg’s description is simple. The top contains a few ideas he wants to remember, including motivational principles and reminders about prioritization. Beneath that are the top three documents he cares about tracking at any moment. If he is worried about revenue, there may be a revenue tracker. If he is worried about post-sales or customer service, there will be a document with stats on that area.

Then come the three goals for the quarter. Usually, he said, they include one hire, one product feature, and one major area of the company that needs to be fixed. Harvey is now shipping about four new products per quarter, but he still identifies the one feature he personally cares about most or needs to focus on most.

The most important part is the daily list. Every day, Weinberg ranks everything he needs to do, refreshes it, crosses items out, and reranks. The value is not the list itself but the repeated act of thinking about what he is doing. He described it as “meta thought about my thoughts.” The more times he clicks into the document during the day and reranks an item, the better his prioritization becomes. His schedule improves. His performance improves. One item goes into first place, often in bold, and everything else gets ignored.

That system is paired with an increasingly aggressive willingness to say no. Weinberg said he now says no to most things, and more things as the company grows. Early on, he said yes to almost everything. When he gets bad at filtering meetings, he asks his chief of staff to force him to write a full paragraph explaining why he should take a meeting. For 99% of meetings or events, the test fails in the first sentence. If he does not want to write the paragraph and can already feel that it is a waste of time, the meeting probably is. For genuinely important things, he said, he could write 20 pages.

This is not framed as productivity hygiene. It is a way to preserve founder attention for the main constraint in the business. Weinberg’s model of a good founder has two parts. First, the founder builds the machine: hiring the right people, creating processes, building product, and establishing the operating structure. Second, once the machine exists, the founder focuses relentlessly on the main bottleneck in the machine.

That means living in the painful parts of the business. If an area is running well, Weinberg tries to ignore it entirely. He focuses on what is burning, and ideally the number one thing that is burning. “If something's going well at the company,” he said, “you are not going to be working on it.”

The difficulty is that the right thing to do often looks bad in the short term. Weinberg said people struggle to say no because yes often creates visible, immediate progress that outsiders reward. If an investor says revenue problems mean the company needs a chief revenue officer, a founder can take meetings with senior candidates and receive pats on the back for “progress.” But if the founder knows the real revenue problem is product quality, the better choice may be to ignore those meetings and fix the product — a path that may look worse for two quarters before it works.

That requires tolerating the period in which everyone thinks things are deteriorating. Weinberg sees this as a recurring pattern among strong founders: the outside world says to go fix one thing, and they put blinders on to fix the harder underlying problem. Sometimes they are vindicated in six months; sometimes a year; sometimes much longer. The hard part is accepting no short-term validation while making the slower decision.

His formal decision principles follow from that view. First, determine immediately whether a decision is a one-way door or a two-way door. His default is that 99.9% of decisions are two-way doors. Second, return to the company’s P0 — the top priority. Does the decision help the P0, hurt it, or have no relevance to it? If it is irrelevant, the exact answer matters less, so decide and move. If it distracts from or harms the P0, the answer is no. Third, be realistic about who will actually do the work.

The failed acquisition that forced Harvey to build the hard way

One of Harvey’s darkest moments came from an attempted acquisition in early 2024. The company was about a year and a half old. Weinberg said they believed there might be a way to “one-shot” the company into being instead of building it the hard way. The target was about 10 times larger than Harvey by headcount, and Harvey tried to buy it at a valuation roughly comparable to, or slightly higher than, Harvey’s own.

They signed the deal before having the money, in the style Weinberg associated with older private-equity tactics: lock up the target, then raise the financing. Harvey was trying to raise around $700 million. It secured roughly $500 million in clean equity, but the deadline arrived while the company was still short. There was an option to take payment-in-kind debt, which Weinberg described as dangerous because failure to meet the debt obligation could allow someone else to own the company. They decided not to take it.

When they pulled out, Weinberg thought Harvey might be over. He worried the company could not build fast enough, that model providers would eat the market, and that without the acquired company Harvey was “screwed.”

The recovery took about 24 hours. He said he has had many moments like that: a massive failure, a day of thinking it is all over, then waking up the next morning and seeing the path again. He connected it to earlier personal failures — failing out of his first high school, underperforming on his first LSAT attempt — moments that felt terminal and were not. Repetition built tolerance.

The failed acquisition forced Harvey to do the work the acquisition had been meant to shortcut: hire the right people, improve product, create scaling processes, and build the company itself. Weinberg now describes that history as part of the reason he is more confident in Harvey, even though external threats are much greater than a year earlier. The team has failed together. It has taken large risks, butchered product lines and recovered them, made wrong hires, and considered a deal that might have collapsed the company. That shared record gives him confidence in the team’s resilience and adaptability.

The same logic underpins what Parrish called “stress maxing.” Weinberg does not mean maximizing stress without limit. He means facing stressful categories of decisions early enough that the stakes are survivable and the repetition reduces future stress. Firing someone in a 10-person company is hard, but avoiding the skill until the company is large and the person is an executive can create a much more dangerous problem.

Weinberg also links stress to decision speed. He makes decisions “really, really fast” for two reasons. First, he maintains current context on the company. Because he has taken little time off and has been “always on,” he experiences the business as a daily delta rather than a fresh download. Second, he has done every part of the company at some point before hiring someone to take it over. That gives him a strong pulse on what is happening.

His regrets, he said, are rarely about making the wrong decision. They are about waiting too long to decide. The metaphor he used is sitting at the bottom of the stairs instead of jumping up one step, figuring it out, and jumping again.

The danger is thrash. At 50 people, if the CEO gets stressed, the company can absorb and interpret it because people know one another. At 800 people, Weinberg said, the CEO’s stress can move the whole company. That creates a boy-who-cried-wolf dynamic in which employees cannot tell what actually matters. The next scaling challenge, in his view, is turning his own anxiety-driven operating system into shared company principles: the concerns are clear, and the company knows how to approach them without relying on his stress to direct motion.

Sleep, exercise, and prioritization are his main tools for managing that stress. A morning run gives him a “shock” that makes the rest of the day calmer. Prioritization reduces stress by shrinking the field of problems. Instead of feeling that a thousand things are wrong, he tries to identify the one thing that is terminal and focus on incrementally improving it.

The people who break are not the ones who make too many mistakes

Weinberg’s hiring filter is built around urgency, resilience, and comfort with reversible decisions. For executives, he looks for people who believe what he believes: the next one or two years may define which companies succeed for the next decade or more. That belief changes how decisions feel. The company cannot behave as though every choice is permanent.

He is wary of people who treat too many things as one-way doors. In his view, many high-achieving candidates have never failed in meaningful ways. They have prestigious backgrounds and may never have gotten a B. That can make every mistake feel existential. Startups, especially in AI, move too fast and face too many losses for that posture to survive.

Weinberg said no one at Harvey has been let go because they made too many mistakes. The failures that lead people to break are different: decision paralysis, inability to scale, failure to hire a strong team, fear of hiring people better than themselves. Those failures come from being unable to tolerate the risk and discomfort required by the work.

This also shapes how he interviews. He spends time on life history, interests, mistakes, and what the candidate wants to do. He also likes live asynchronous work in a Google document. He will create a document with questions or a project and work through it with the candidate. Partly, this tests whether they can work in the async style he prefers. It also reveals hedging. If a question is basically an option A or option B decision and the candidate writes 20 paragraphs, Weinberg reads that as a signal: if they hedge this much here, what will they do under pressure?

His broader criticism is that people often try to plan every stair before climbing. Worse, they may start climbing, gain new information from step three, and still refuse to change because the original plan said to continue to step four, five, and six. To Weinberg, that is fear of being wrong masquerading as discipline.

“Building a company is a thousand failures and then a couple successes,” he said. The point is not to avoid failed bets. It is to make many bets, adapt as information changes, and develop tolerance for being wrong at small scales before the stakes become larger.

Urgency, in his view, cannot be maintained by a founder poking every part of the company. That only creates thrash. The scalable version is hiring and culture: hire leaders who feel the urgency, promote people who transmit it, and let those leaders instill it in their own teams. The founder’s job becomes building a culture and hiring process that selects for urgency rather than substituting personal pressure for organizational design.

The founder should come back to product, people, and flexible vision

When asked to distill what he has learned about running a company, Weinberg gave three principles.

First, once the company has some distribution — once some customers know who it is — founders should spend as much time as possible on product. The biggest mistakes he has made as an executive, he said, came when he stepped away from product and tried to compensate by doing sales. That can help for a few quarters and feel gratifying, but product is “the only thing that scales.”

Second, the company must constantly ask whether the right people are in the right positions. Weinberg does not define this only as hiring and firing. It includes whether a person fits the role, whether they are being mentored well, whether they are mentoring their own team, and whether swim lanes are clear. He does not think a CEO can spend enough time on that problem.

Third, vision-setting has to remain flexible. This is the principle he thinks he has least mastered. He described writing a long product vision document in the third quarter of the prior year, laying out a multi-year view. The mistake was that people interpreted it as something to build immediately. Product timings and launches slipped because he did not communicate the level of abstraction properly.

The unresolved management problem is how to distinguish between high-level vision and near-term execution. When should the CEO describe the five-year direction, and when should he say what must happen today or tomorrow? Weinberg said he still struggles with that.

He has also changed his view on senior executives. Early advisers told him to hire them very soon. He refused, and he thinks he was right to refuse at first. Then he refused about six months too long. He knew he was late only after seeing the impact of the executives once hired. His explanation returns to founder context: a founder who has been present since day one only has to update the company story day by day. A newcomer has to absorb years of context at once. That makes founders unusually good at sensing when something should have changed earlier, even if they cannot prove it in advance.

What he thinks Harvey has gotten right is not a single product or financing decision, but the construction of a resilient team. He said he is proud of the team, while acknowledging that he has high standards and can be harsh. He believes the company has made choices that favor long-term survival over short-term maximization: investing in post-sales, choosing customers carefully, thinking about brand, and structuring the team for durability. Harvey may have sacrificed some short-term ground to competitors because of those choices, but Weinberg would rather build a company without a ceiling than maximize the next three years and then be trapped.

His definition of success follows the same pattern. He wants Harvey to feel that it left everything on the table over the next few years. He credited “Harley” for the idea that the company has to re-earn its position every six months. In AI, he said, the bar doubles or triples in height each time. Success would be knowing that every six months, the team did everything it could to clear that higher bar.

For Weinberg personally, the company has become inseparable from that definition. He said he did not really find his way for 27 years before Harvey and had periods of serious mental health difficulty. Building Harvey is painful, but it is also the most fun he has had, and “not close.” He described life before and after the company as two different lives, and said he does not want to go back.

The frontier, in your inbox tomorrow at 08:00.

Sign up free. Pick the industry Briefs you want. Tomorrow morning, they land. No credit card.

Sign up free