AI? ???? ???? ??? ??? ??? ????? ??????. ? ?? ????, ??? ??? ??? ??, ???? ??? ?????? ???? ?? ????? ?? ???? ???? ???? ??? ??? ??? ??, ??? ?? ?? ?? ?? ??? ??? ? ?? ????, ???? ???? AI ?? ??? ?? ???? ??????. ??? ????? ?? ???? ?? ??? ??? ??? AI ????? ??? ????? ??? ???? ??? ??? ??? ? ? ????.
??? ??? ??? ??? ??, RNA ? ???? ????, ??? ??? ?? ?? ??? ?? ?? ???? ??? ??? ???? ??, ??, ??? ???? ???? ?? ???? ??? ??? ???? ???. ???? ??? ??? ???? ? ???? ?? ??, ???? ?? ??, ?? ???? ??? ?? ???? ?? ??? ??? ? ????.
AI ?? ??? ??? ??? ?? ??? ???? ???, ??? ????? ??? ??? ???? ?? ??? ??? ???? ??? ?? ???? ??? ?? ??? ?? ?????.
- /NVIDIA/bionemo-examples ?? ???? ???? NVIDIA BioNeMo NIM ????????? Evo 2? ??? ??????.
- ??? ??? ?? ??? ?? ?????? ?????.
- BioNeMo ??????? ?? ?? ???? ???? Evo 2 ??? ??? ???.
- NVIDIA BioNeMo ???? ?? ????? ?????.
???? ??? ???? ??? ??? ? ??? ??
2024? 11?? ? Evo ??? ??? ???? ???? ???? ????, DNA, RNA, ??? ??? ?? ???? ???? ???? ??? ? ?? ????? ??? ??????.
???? ??? ?? ???? ?? ????? ????? ??? ??? Evo? ?? ???? ???? ??? ??? ???? ???? ??? ?? ??? ???? ???? ??? ????. 3?? ?? ?????? ??? ???? 270? ?? ???? ? ?? ???? ?? ??? Evo? ?? ???? ?? ? ?? ???? ?? ?????? ???? ??????.
Evo? ??? ??? ???? StripedHyena ???????(?? 1). StripedHyena? 29?? Hyena ???? ???? ????? ????, Transformer ????? ???? ?? ??? ????? ???? ?? ? ???? ??? ????? ??? ??? ??? ? ?? ???????. ?? ???? ??? ???? ??? ?????.
? ?? ??? Evo? ?? Transformer ??? ??? ????, ?? 131,072?? ???? ??? ? ????? ????? ??? ? ?? ?????. ? ??, ?? ??? ??? ?? ??? ? ??? ??? ??? ??? ? ?? ??? ???? ?? ???? ?? ???? ?? ??? ??????.

Evo? ?? ??? ??? ???? ??? ??? ?????. EVO? ???, ???? RNA, ?? DNA? ?? ????? ??? ?? ??? ???? ?? ??? ???? ??? ?? ??? ?????, ?? ??? ? ?? ?? ??? ?? ??? ????? ??????.
Evo? ?? ???? CRISPR-Cas ???? ?????? ???? ??? ??? ??? ??????. ??? ??? ??? ?? ??????, Evo? ?? ???? ??? ? ?? ??? ??? ???? ???? ??? ? ??? ?????.
Evo? ?? ?? ? ?? ??? ???? ??? ?? ??? ???? ? ?? ??? ?? ??? ??????. ??? ??? ??? ???? ?? ???? ?? ??? ???? ??? ? ?? ??? ?? ?? ? ?? ?? ??? ??? ?????.
Evo? ???? ??? ???? ??? ? ???? ????. ? ?? ?????? ???? ??? ???? ??? AI? ???? ???? ??? ??? ??? ???????.
?? ???? ??? ?? ??
Evo 2? ??? ????? ??? ?? ??? ?? ??? ????, Evo? ??? ???? ??? ???, ??? ????, ??? ??? ?????.
Evo 2? DNA, RNA, ???? ? ?? ?? ?? ??? Eukarya, Prokarya, Archaea? ? ?? ?? ??? ?? ????? ??? ? ????. 15,032?? ???? ???? 113,379?? ???? ????? ??? 8.85T ??????? ??? ??? ??? ? ?? ??? ??? ??? ?? ????, ?? ?? ?? ? ???? ???? ??, ???? ????? ??? ?? Evo? ?? ?? ??? ?? ?????.
Evo 2? ?? 400? ?? ????? ??? ??? ??? StripedHyena 2 ????? ????, ??? ?? ???? 1?? ?? ???? ??? ??? ???? ???? ??? ?????. StripedHyena 2? ????? ???? ?? ?? ????? ?? ???, Transformer? ?? ?? ????? ?? ?? ??? ???? ?? ????? ???? ?? ??? ?????.
?? ? Evo 2 ??? AWS?? NVIDIA DGX Cloud? ???? 2,048?? NVIDIA H100 GPU? ???????. NVIDIA? Arc? ????? ?? NVIDIA AI ?????? ?? ???? ??? ?? ??? ???? ???? ? ?? ??? AI ???? ??? ? ?????.
??? ??? Evo? 70? ? ????? 131,000 ?? ???? ??? ?? ???? ?? ??? ?????, Evo 2? ?? ?? ? ?? ?? ??? ???? ???? ????????(? 1).
?? | Evo | Evo 2 |
??? ?? ??? | ???? + ?????? (300B ??????) | ?? ?? ?? + ?????? (9T ??????) |
?? ???? | 7B | 7B + 40B |
???? ?? | 131,072? ?? | ?? 1,048,576? ?? |
?? | DNA, RNA, ??? | DNA, RNA, ??? |
?? | ???? ???? ?? | ???? ???? ?? |
?? | ??? ?? ? ?? | ???? ?? ? ?? |
Evo 2? ??? ?? ???? ??? ????? ??? ??? ?? ???? ??? ??? ??? ? ??? ?????. ? ?? ?? ??? DNA, RNA, ??? ???? ?????, ???? ?? ??, ??? ??? ?? ???? ??? ??? ???? ???. Evo 2? ?? ???? ???? ???? Evo? ????? ????, ?? ??, ??, ?? ??? ?? ? ?? ????? ???? ???.
Evo 2? ?? ??? ??? ?? ???? ??? ??? ?????.
- ?? ?? ??: ?? ? ???? ??? ???? ??? ?? ?? ?? ????? ??? ??? ?? ?? ?? ???? ? ??? ???? ?????.
- ??? ???: ???? ? ???? ????? ?? ???? ????, ?? ??? ??? ?? ????, ?? ??? ? ??? ??? ?? ?? ??? ?????.
- ??? ??: ??? ??? ???? ? ???? ??? ?? ??? ??? ???? ????, ??? ???? ?? ??? ??? ?? ?? ?? ??? ??? ??? ?? ??? ??? ?????.
NVIDIA Evo 2 NIM ??????? ??
NVIDIA Evo 2 NIM ???????? ???, ???, ?? ????? ??? ? ?? ??? ??? API? ?? ??? ??? ??? ???? ? ?????.
# Define JSON example human L1 retrotransposable element sequence
example = {
# nucleotide sequence to be analyzed
"sequence": "GAATAGGAACAGCTCCGGTCTACAGCTCCCAGCGTGAGCGACGCAGAAGACGGTGATTTCTGCATTTCCATCTGAGGTACCGGGTTCATCTCACTAGGGAGTGCCAGACAGTGGGCGCAGGCCAGTGTGTGTGCGCACCGTGCGCGAGCCGAAGCAGGGCGAGGCATTGCCTCACCTGGGAAGCGCAAGGGGTCAGGGAGTTCCCTTTCCGAGTCAAAGAAAGGGGTGATGGACGCACCTGGAAAATCGGGTCACTCCCACCCGAATATTGCGCTTTTCAGACCGGCTTAAGAAACGGCGCACCACGAGACTATATCCCACACCTGGCTCAGAGGGTCCTACGCCCACGGAATC",
"num_tokens": 102, # number of tokens to generate
"top_k": 4, # only predict top 4 most likely outcomes per token
"top_p": 1.0, # include 100% cumulative prob results in sampling
"temperature": 0.7, # add variability (creativity) to predictions
"
": True, # enable more diverse outputs
"enable_logits": False, # disable raw model output (logits)
}
# Retrieve the API key from the environment
key = os.getenv("NVCF_RUN_KEY")
# Send the example sequence and parameters to the Evo 2 API
r = requests.post(
# Example URL for the Evo 2 model API.
url=os.getenv("URL","https://health.api.nvidia.com/v1/biology/arc/evo2-40b/generate")
# Authorization headers to authenticate with the API
headers={"Authorization": f"Bearer {key}"},
# The data payload (sequence and parameters) sent as JSON
json=example,
)
??? ????? API ??? ?? ??? ??? NVIDIA BioNeMo ????? ??? ?????.
Evo 2? ?? ?? NVIDIA BioNeMo ?????? ???? ?? ??? ?? ????. ?? Evo 2? ?? ?? ??? ??? BioPharma? ?? ??? ?? ???? ?? ??? ??? ?????.
# Prepare raw sequence data for training based on a YAML config file
preprocess_evo2 -c data_preproc_config.yaml
# Trains the Evo 2 model with preprocessed data and parallelism across multiple GPUs
torchrun --nproc-per-node=8 --no-python train_Evo 2 -d data_train_config.yaml --num-nodes=1 --devices=8 --max-steps=100 --val-check-interval=25 --experiment-dir=/workspace/bionemo2/model/checkpoints/example --seq-length=8192 --tensor-parallel-size=4 --pipeline-model-parallel-size=1 --context-parallel-size=2 --sequence-parallel --global-batch-size=8 --micro-batch-size=1 --model-size=7b --fp8 --tflops-callback
# Optional Fine-tuning: Add this argument to start from a pretrained model
# --ckpt-dir=/path/to/pretrained_checkpoint
Evo 2? ??? ???? AI? ??
AI? ??? ??? ??? ???? ??? ?? ???, ???? ?? ?? ?? ??? ??? ??? ???? ???. Evo 2? ?? ?? ???? DNA, RNA, ??? ??? ???? ??? ? ?? ??? ????? ??? ????, ??? ??? ??? ?????.
Evo? ????? ???? ??? ??? ??? ???? ? ?????, Evo 2? ??? ?? ? ????? ??? ?? ? ???? ??, ???? ???, ?? ??, ??? ??? ???? ? ??? ?????.
? ??? ???? ??? ???? ??? ?? ?? ??? ??? ????? ??? ??? ??? ? ?? Evo 2? ??? ? ?????? ?????. ? ???? ???? ???? ??? ???? ??? ??? ? ???, ?? ??, ??, ?? ??? ??? ??? ??? ?? ?? ? ?? ?? ????? ??? ? ????.
? ??? ???? ??, Evo 2? ?? ??? AI? ?? ??? ???? ????, ??? ??? ??? ???? ???? ? ???? ??? ?? ??? ???? ????. ??? ??? AI ?? ??? ???? ???? ????, ?? ??? ??? ??? ?? ???? ???? ????? ??? ???? ????. Evo 2? ??? AI? ?? ??? ??, ??, ????? ? ???? ? ? ???? ? ??? ?????.
Evo 2? ?? ??? ??? Arc Institute?? ??? ?? ???? ?????. Evo 2? NVIDIA BioNeMo ??? ???? ??? ? ????.
?? ?? ??
? ???? ????, ???, ?? ???? ??? ??? ? ??, ??? ??? ?? ???? ??????.
- Garyk Brixi, ???? ??? ??? ?? ??
- Jerome Ku, Arc Institute? ???? ?? ?? ????
- Michael Poli, Liquid AI ?? ??? ? ???? ??? ????? ?? ??
- Greg Brockman, OpenAI? ?? ??? ? ??
- Eric Nguyen, ???? ??? ???? ?? ??
- Brandon Yang, Cartesia AI ?? ??? ? ???? ??? ????? ?? ?? (?? ?)
- Dave Burke, Arc Institute ?? ?? ???
- Hani Goodarzi, Arc Institute? ?? ???, ????? ?? ??????(UCSF) ????? ? ??? ???
- Patrick Hsu, Arc Institute ?? ???, ????? ?? ???(UC Berkeley) ????? ??? ? Deb Faculty Fellow
- Brian Hie, ???? ??? ????? ???, Dieter Schwarz Foundation Stanford Data Science(SDS) Faculty Fellow, Arc Institute ?? ???, ???? Laboratory of Evolutionary Design ??
?? ???
- GTC ??: ???? ??: ???? ?? ??? AI ??? ??
- GTC ??: 3D BiCycle GAN? LLM? ??: ?? ?? ?? ?? ??? ??
- GTC ??: Amgen? ?? ??? ??? ?? AWS ? Amazon EC2 ?????? NVIDIA DGX Cloud? ???? ??(Amazon Web Services ??)
- NGC ????: AlphaFold2
- NGC ????: AlphaFold2-Multimer
- ???: ?? ?? ??