Kaushal
kd-tensor
·
AI & ML interests
model inferencing, synthetic data generation, model fine-tuning
Organizations
None yet
safety-alignment
Papers to read
-
Jamba: A Hybrid Transformer-Mamba Language Model
Paper • 2403.19887 • Published • 111 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 82 -
Tuning Language Models by Proxy
Paper • 2401.08565 • Published • 22 -
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning
Paper • 2402.04833 • Published • 5
RAG
safety-alignment
toread
Papers to read
-
Jamba: A Hybrid Transformer-Mamba Language Model
Paper • 2403.19887 • Published • 111 -
The Unreasonable Ineffectiveness of the Deeper Layers
Paper • 2403.17887 • Published • 82 -
Tuning Language Models by Proxy
Paper • 2401.08565 • Published • 22 -
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning
Paper • 2402.04833 • Published • 5
Synthetic Data Generation
General collection for making stuff up! :)