undefined

points

[-]

All of what you said makes sense from the perspective of a product manager working for a for-profit company trying to maximize profit either today or eventually.

But the submission blog post writes:

> To advance scientific research, we’re making AlphaGenome available in preview via our AlphaGenome API for non-commercial research, and planning to release the model in the future. We believe AlphaGenome can be a valuable resource for the scientific community, helping scientists better understand genome function, disease biology, and ultimately, drive new biological discoveries and the development of new treatments.

And at that point, they're painting this release as something they did in order to "advance scientific research" and because they believe "AlphaGenome can be a valuable resource".

So now they're at a cross-point, is this release actually for advancing scientific research and if so, why aren't they doing it in a way so it actually maximizes advancing scientific research, which I think is the point parent's comment.

Even the most basic principle for doing research, being able to reproduce something, goes out the window when you put it behind an API, so personally I doubt their ultimate goal here is to serve the scientific community.

Edit: Reading further comments it seems like they've at least claimed they want to do a model+weights release of this though (from the paper: "The model source code and weights will also be provided upon final publication.") so remains to be seen if they'll go through with it or not.

by kuboble225 days ago|

parent|

[-]

But to add some historical context.

Similarly with alpha Go they claimed to do it "to advance go" and help go community, but they played Lee se dol, released few curated self play games, collected publicity and abandoned go with no artifacts like source or weights.

But in hindsight their paper turned out to be almost 100% reproducible and resulted in super-human open-source alternative less than a year later.

So the story might repeat here. And they will achieve started goal without releasing anything

by wrsh07225 days ago|

parent|

prev|

[-]

To be clear: I agree that opening up model + weights makes it possible for third parties to distill or fine tune

If you look at the frenzy of activity that happened after midjourney became accessible, that was awesome for everyone. Midjourney probably got help running their model efficiently and a ton of progress was quickly made.

I'm pretty sympathetic to a company doing a windowing strategy: prepare the API as a sort of beta release timed with the announcement. Spend some time cleaning up the code for public release (at Google this means ripping out internal dependencies that aren't open source), and then release a reference inference implementation along with the weights.

That's pretty reasonable. I wanted to push back on this idea that "the reason Google isn't dropping model + weights is because the corporate screws are coming down hard"

Google isn't waiting to release the weights so that they can profit from this. It's essentially the first step in the process, and serving via API gives them valuable usage data they they might not get if/when it's open sourced

by jryb225 days ago|

parent|

[-]

I take most of your points except the last one. The feedback would come in the form of publications, definitely from academia and to a lesser degree industry (admittedly a slow iteration time). Also just public discourse - there was no dearth of very specific, highly technical feedback for any of the releases of alphafold on twitter, for example.

But I can’t use this at all at work (a pharma company) because it would leak confidential information. So anything they learn from usage data is systematically excluding (the vast majority of?) people working on therapeutics.

by wrsh07225 days ago|

parent|

[-]

If it's worth using for you in your work, it might be worth your employer striking a deal about data confidentiality so you can use it.

But you couldn't use it for work anyway because usage is non commercial. So you need to pay them to change the license anyway.

by diggan225 days ago|

parent|

prev|

[-]

> serving via API gives them valuable usage data

It might give them a bit, but AFAIK most institutions (especially non-American ones) aren't exactly overly happy about using closed American APIs in order to do science, especially not because API usage isn't reproducible.

Sure, they might be able to play around with some toy data, but for Google to actually get valuable usage data, then they need to let people actually use the thing for real things, and then you cannot gate it behind a API, it isn't feasible in a real-world environment.

by pfisherman224 days ago|

parent|

prev|

[-]

Key question is the license they attach to model and weights. I have been seeing an increasing amount of releases in this space under non-commercial licenses.

I think companies in the space should either totally open source or not publish at all.

I can see publishing like this as achieving one (or more) of a several objectives:

1. Marketing software to for sales / licensing

2. Marketing startup to investors

3. Crowdsourcing use cases or product features from academia

Now here are the problems with those:

1. Selling software (exclusively) to drug companies is a terrible business model. Very low ceiling there. You can make more from one drug.

2. Indicates company focus is producing models and not drugs. See point one.

3. Computational labs want to release open source, so not viable to build on restricted tooling. Experimental labs may just be using to algo-wash prior hypotheses / biases.

Now weigh against disadvantage of letting competitors know what you are working on, how far you have progressed, as well as your methods.

by MattRix225 days ago|

parent|

prev|

[-]

I feel like this take is missing a sense of balance. You can have a goal of advancing scientific research while also still making money. You don’t have to choose one extreme end of the scale.

I’d argue that the product providing some monetary value for Google will help ensure that this team doesn’t get moved some more profitable project instead. That way they can continue improving this tool and make more tools like it in the future.

by mensetmanusman224 days ago|

parent|

[-]

Working in research and development for 20 years, I can assure you that the only science that leaves the lab is that which can make money.

by LarsDu88225 days ago|

prev|

[-]

The predecessor to this model Enformer, which was developed in collaboration with Calico had a weight release and a source release.

The precedent I'm going with is specifically in the gene regulatory realm.

Furthermore, a weight release would allow others to finetune the model on different datasets and/or organisms.

by Onawa225 days ago|

prev|

[-]

I think that from a research/academic view of the landscape, building off a mutable API is much less preferred than building of a set of open weights. It would be even better if we had the training data, along with all code and open weights. However, I would take open weights over almost anything else in the current landscape.

by wrsh07225 days ago|

parent|

[-]

If it came to light that somebody found a way to use this API in a way that is harmful to society would you be happy that Google could revoke access? Or unhappy?

This is a real tradeoff of freedom vs _. I agree that I'm not always a fan of Google being the one in control, but I'm much happier that they are even releasing an API. That's not something they did for go! (Of course there was a book written so someone got access)

by roughly225 days ago|

parent|

[-]

If it came to light that somebody found a way to use this API in a way that is beneficial to society, would you be happy that Google could revoke access? Or unhappy?

by wrsh07220 days ago|

parent|

[-]

This is a bit of a false dichotomy.

If it's useful, it's good that it was created. We should be happy about the progress. If it's harmful, we should be happy about guardrails.

Friction is a useful tool, I don't begrudge people or companies from employing it

by nimchimpsky225 days ago|

parent|

prev|

[-]

[dead]