More

fnbr · 2025-11-21T16:52:25 1763743945

(I'm a researcher on Olmo.)

There's a bunch of other fully open models, including the [Marin](https://marin.community/) series of models out of Stanford and Nvidia regularly releases fully open models.

fnbr · 2025-11-21T15:06:25 1763737585

(I’m a researcher on the post-training team at Ai2.)

Where did you try this? On the Ai2 playground?

tcsenpai · 2025-11-22T09:07:49 1763802469

Hello! On Open WebUI using ollama as a backend :)

I guess Ollama needs to update their version, maybe!

fnbr · 2025-11-21T15:05:33 1763737533

(I’m a researcher on the post-training team at Ai2.)

7B models are mostly useful for local use on consumer GPUs. 32B could be used for a lot of applications. There’s a lot of companies using fine tuned Qwen 3 models that might want to switch to Olmo now that we have released a 32B base model.

littlestymaar · 2025-11-21T15:46:21 1763739981

May I ask why you went for a 7B and a 32B dense models instead of a small MoE like Qwen3-30B-A3B or gpt-oss-20b given how successful these MoE experiments were?

fnbr · 2025-11-21T16:32:18 1763742738

MoEs have a lot of technical complexity and aren't well supported in the open source world. We plan to release a MoE soon(ish).

I do think that MoEs are clearly the future. I think we will release more MoEs moving forward once we have the tech in place to do so efficiently. For all use cases except local usage, I think that MoEs are clearly superior to dense models.

trebligdivad · 2025-11-22T00:48:23 1763772503

Even local, MoE are just so much faster, and they let you pick a large/less quantized model and still get a useful speed.

riazrizvi · 2025-11-21T16:37:37 1763743057

7B runs on my Intel Macbook Pro - there is a broad practical application served here for developers who need to figure out a project on their own hardware, which improves time/cost/effort economy. Before committing to a bigger model for the same project.

kurthr · 2025-11-22T00:53:25 1763772805

Are there quantized (eg 4bit) models available yet? I assume the training was done in BF16, but it seems like most inference models are distributed in BF8 until they're quantized.

edit ahh I see it on huggingface: https://huggingface.co/mlx-community/Olmo-3-1125-32B-4bit

fnbr · 2025-04-13T22:52:04 1744584724

This is why I will never work somewhere with a short post termination exercise period (PTEP). If it’s not at least 5 years, ideally 10, they don’t seriously consider equity something that employees are owed.

fnbr · 2025-04-13T22:49:30 1744584570

Can you explain? In most cases, preferences won’t come into play, assuming you raise at a standard 1x preference and sell for more than you have raised. In that case, owning 0.5% should roughly translate into $5M (modulo dilution).

wrs · 2025-04-13T23:11:21 1744585881

There are plenty of valid scenarios where the company sells for a lot, but less than it raised. And 1x preferences are no longer standard post-ZIRP, afaik.

People are often not aware that the value of common is nonlinear, so the value of 0.5% in this case is zero. (For the ML fans out there, the common price per share has one or more ReLU activation layers. :) )

est31 · 2025-04-13T23:35:42 1744587342

Even with 1x preferences, the company might have raised $2 billion but sells for $1 billion because the investors don't want to get any further losses.

The general rule of thumb is that acquisitions are bad for employees, and IPOs are good, especially if the share price is stable for 6 months.

jaredsohn · 2025-04-14T00:31:15 1744590675

Also for acquisitions, often you'll have to work at the acquiring company for some time to get money from your options. Or might get options in the acquiring company instead (which again are worth nothing until some future possible equity event which hopefully translates into cash).

pc86 · 2025-04-14T12:48:43 1744634923

Have 1x preferences become standard? When I worked in startups early investors often has 2x or 3x liquidation preferences, especially at seed.

immibis · 2025-04-13T23:11:11 1744585871

That would be the naive mathematical interpretation and how the system would work if engineers designed it. Lawyers designed it, though, and they probably know some tricks to make that not happen.

guappa · 2025-04-14T09:45:48 1744623948

You think engineers never scam?

Der_Einzige · 2025-04-14T10:31:23 1744626683

Not like lawyers, dentists, car salesmen, etc do!

fnbr · 2025-04-14T01:30:36 1744594236

Like what? All the examples people have said are where either

1) the company has Nx preferences, for N >1, in which case the company has essentially failed to fundraise or

2) the company sells for less than they raised, which again, is a polite form of failure.

cyanydeez · 2025-04-14T01:40:12 1744594812

lets no degrade lawyers more than necessary.

Business people hired lawyers to design means and methods to commit _implicit_ fraud and deceptive practices to improve the value of their capital assets.

Those lawyers then go on to sell this product to others.

I'm sure there's some lawyers out there that are going out there shopping this stuff around, but it's Capitalism and Business thats the active agent, not Lawyers.

pdntspa · 2025-04-14T15:35:19 1744644919

I am under the impression that an oversized cap table is pretty much standard. Am I wrong?

fnbr · on Aug 12, 2024

I hate how every company that I place an order with treats that as permission to send a constant drip of marketing emails. I send them straight to spam.

mtmail · on Aug 12, 2024

The article is not about emails, it’s about cancelling paid subscriptions.

fnbr · on Aug 2, 2024

They raised ~$200M total, and $150M at the series A. So if they're just paying back investors, they'd "only" need $500M.

fnbr · on Aug 2, 2024

do we know they were profitable? I doubt it, if they pulled this. I think they had high DAU/MAU but low paid users.

this is basically Inflection 2.0.

jsheard · on Aug 2, 2024

All public information about their finances only mentions their revenue, not their profits, which means they're almost certainly not profitable.

beoberha · on Aug 2, 2024

I would be absolutely floored if any of these new crop of AI companies set profitable

fnbr · on July 23, 2024

Yes, many AI startups pay more than FAANG for ML talent.

neilv · on July 23, 2024

I think FAANG-like big IC offers right now are flying around for technical people familiar with the current hot "AI" methods. A lot of hype, a lot of situational ethics around copyright, and some investment scams, but some of the tech is ready for legitimate things people want to do, at acceptable quality levels for the application.

Also, one thing that happened early in the dotcom gold rush is that a ton of people swarmed in, all suddenly acting like experts and professionals, with little/no prior experience. Meanwhile, Internet people who were also prolific programmers were, like, who are all these people, and why do many of them have fashionable eyeglasses like nerds would never try to pull off. I don't know how much we'll see something like that this time.

ai4ever · on July 23, 2024

agree 100% with this nice comment from a fellow dot-com crash observer.

Many HackerNews readers are young'uns who are seduced by FANG salaries, and the latest AI/ML bandwagon. They dont have the perspective one gets having seeing the highs and the lows.

I can bet there will be another rude awakening around the corner which will wipe out the "pretenders". After the dot-com bust, many pretenders left the silicon valley with anecdotes of leased BMWs abandoned at SFO airport.

After the AI/ML hype deflates, there is likely to be a similar separation. Folks in it for real will be separated from the folks who came in for the riches.

And for the history-buffs this rhymes with the CA gold-rush circa late 1800s - some found it, most didnt, levis profited selling denims to them.

callalex · on July 23, 2024

But will that still be true in 6-12 months when the bubble pops?

fnbr · on May 30, 2024

What's the dating app?