Eric Nguyen, a PhD candidate at Stanford University, joins us today. We discuss his work on lengthy context foundation models, its use in biology in particular, and how it evolved into and models during our chat. We talk about Hyena, a language model built on convolutionals that was created to address the difficulties associated with large context lengths in language modeling. We examine the drawbacks of utilizing transformers for longer sequences, the advantages of convolutional models over transformers, the architecture and training of the models, the function of FFT in computational optimizations, and the explainability of the models in long-sequence convolutions. We also discussed Hyena DNA, a genomic foundation model that was created to identify long-range connections in DNA sequences and was pre-trained on one million tokens. Lastly, Eric presents Evo, a hybrid model with seven billion parameters that combines the convolutional architecture of Hyena DNA with attention layers. We discuss language models for both generating and designing DNA, as well as the trade-offs between state-of-the-art models, zero-shot versus few-shot performance, evaluation benchmarks, and the exciting potential in domains like CRISPR-Cas gene editing.
HomeMachine LearningMachine Learning MediaExtended context language models and their biological uses
Extended context language models and their biological uses
Previous article
Next article
Youtube @blockgeni

Elon Musk’s DOGE Uses AI to Process Sensitive Government Data
06:28

DeepSeek's impact on the US AI Market
14:01

Is a Crypto Correction Inevitable ??
03:49

Coinbase and Goldman Sachs alum launch TrueX
00:52

Trump’s new crypto venture is vague but full of ethical issues
00:53

California passes AI laws to stop election deepfakes
00:54

AI Regulation Is Simpler Than You May Imagine
00:53

FBI says Crypto-related fraud jumped by 45% last year
00:53

Conversations with AI can dispel conspiracies
00:44

Trump plans to launch his sons’ crypto business
00:48

