view article Article TimeScope: How Long Can Your Video Large Multimodal Model Go? +2 Jul 23, 2025 • 47
view article Article How to generate text: using different decoding methods for language generation with Transformers Mar 1, 2020 • 283
view post Post 1828 I was curious about the Block Diffusion hybrid model and tried retraining it on a DNA tokenizer + dataset 🧬. Too early to evaluate, but it generates sequences (AAATGG TTATTG CAAATC...) and was improving on the validation set during trainingModel: monsoon-nlp/dna-blockdiff-papayaOriginal paper: Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models (2503.09573) See translation 🔥 8 8 + Reply