vishal

@vishal_learner - 47 本の動画

チャンネル登録者数 121人

Machine Learning.

最近の動画

Understanding ColBERT's ivf.pid.pt: Inspecting Intermediate Artifacts from _build_ivf & optimize_ivf

Understanding ColBERT's ivf.pid.pt: Inspecting Intermediate Artifacts from _build_ivf & optimize_ivf

Do RAGatouille and ColBERT Produce the Same Index and Retrieval Scores? A Deep Dive Comparison

Do RAGatouille and ColBERT Produce the Same Index and Retrieval Scores? A Deep Dive Comparison

TIL: Understanding LLM Foundry's BinPackCollator (Sequence Packing for 95% Token Efficiency!)

TIL: Understanding LLM Foundry's BinPackCollator (Sequence Packing for 95% Token Efficiency!)

Improving LLM Judge Alignment: Enhancing TinyScale Lab Evaluation Agreement to 94%

Improving LLM Judge Alignment: Enhancing TinyScale Lab Evaluation Agreement to 94%

Technical Report Summary: Nomic Embed

Technical Report Summary: Nomic Embed

Understanding Sequence Packing: Initial Musings

Understanding Sequence Packing: Initial Musings

Building an LLM Judge Agreement App: 7 Iterations from Basic to Full Functionality

Building an LLM Judge Agreement App: 7 Iterations from Basic to Full Functionality

Evaluating First Attempt LLM Judge Scores: Improving Claude Haiku Alignment for Story Scoring

Evaluating First Attempt LLM Judge Scores: Improving Claude Haiku Alignment for Story Scoring

Manual Scoring Results for TinyStories Models: Grammar, Reasoning, and Emergent Capabilities

Manual Scoring Results for TinyStories Models: Grammar, Reasoning, and Emergent Capabilities

Look at Your Data: Building an LM Scoring App with FastHTML

Look at Your Data: Building an LM Scoring App with FastHTML

TSL: Curating Evaluation Prompts, Defining Scoring Criteria + Designing LLM Judge Prompt Template

TSL: Curating Evaluation Prompts, Defining Scoring Criteria + Designing LLM Judge Prompt Template

TinyScale Lab Update: Setting Eval Targets + Generation Completions for LLM Judge Development

TinyScale Lab Update: Setting Eval Targets + Generation Completions for LLM Judge Development

TinyScaleLab Project Update: Training Cost Analysis and Evaluation Infrastructure Plans

TinyScaleLab Project Update: Training Cost Analysis and Evaluation Infrastructure Plans

TinyScale Lab: Exploring the Connection Between Training Dynamics and Model Capabilities

TinyScale Lab: Exploring the Connection Between Training Dynamics and Model Capabilities

Research Paper Summary: Small-scale proxies for large-scale Transformer training instabilities

Research Paper Summary: Small-scale proxies for large-scale Transformer training instabilities

My Second-Place Winning Tiny Model Hackathon Journey: Pre-Training from Scratch

My Second-Place Winning Tiny Model Hackathon Journey: Pre-Training from Scratch

LossInspector: A Deep Dive Into LLM-Foundry's Next-Token Prediction with a Custom Composer Callback

LossInspector: A Deep Dive Into LLM-Foundry's Next-Token Prediction with a Custom Composer Callback

The Evolution of Matrix Multiplication: 12,000x Numba Speedup 🚀 | fastai Course Lesson 11

The Evolution of Matrix Multiplication: 12,000x Numba Speedup 🚀 | fastai Course Lesson 11

Research Paper Summary: TinyStories

Research Paper Summary: TinyStories

Look at Your Data: Manual Validation of Retrieval Metrics

Look at Your Data: Manual Validation of Retrieval Metrics

Creating a Custom Composer Callback to Track Data Types in LLM Training | Mixed Precision Deep Dive

Creating a Custom Composer Callback to Track Data Types in LLM Training | Mixed Precision Deep Dive

Paper Reading: Small-scale proxies for large-scale Transformer training instabilities

Paper Reading: Small-scale proxies for large-scale Transformer training instabilities

Paper Reading: Overtrained Language Models Are Harder to Fine-Tune

Paper Reading: Overtrained Language Models Are Harder to Fine-Tune

Paper Reading: SmolLM2

Paper Reading: SmolLM2

Exploring Sequential and Merged Linear Layer Forward Passes

Exploring Sequential and Merged Linear Layer Forward Passes

TIL: Using PyTorch's register_forward_hook to Trace Floating Point Errors

TIL: Using PyTorch's register_forward_hook to Trace Floating Point Errors

Debugging Un-Merged and Merged LoRA Model Output Differences

Debugging Un-Merged and Merged LoRA Model Output Differences

LoraModel.merge_and_unload Deep Dive

LoraModel.merge_and_unload Deep Dive

RAGatouille/ColBERT Indexing Deep Dive

RAGatouille/ColBERT Indexing Deep Dive

Recreating Plots from Appendix A.4 of the DoRA Paper for LoRA Learns Less and Forgets Less Models

Recreating Plots from Appendix A.4 of the DoRA Paper for LoRA Learns Less and Forgets Less Models

TIL: PeftModel Base Model Behavior

TIL: PeftModel Base Model Behavior

Research Paper Summary: Hypencoder: Hypernetworks for Information Retrieval

Research Paper Summary: Hypencoder: Hypernetworks for Information Retrieval

Code Walkthrough - peft DoRA Implementation

Code Walkthrough - peft DoRA Implementation

Research Paper Summary: rsLoRA

Research Paper Summary: rsLoRA

Research Paper Summary: LoRA Learns Less and Forgets Less

Research Paper Summary: LoRA Learns Less and Forgets Less

Recreating the PLAID ColBERTv2 Scoring Pipeline: From Research Code to RAGatouille

Recreating the PLAID ColBERTv2 Scoring Pipeline: From Research Code to RAGatouille

fastbook-benchmark: ColBERT Search

fastbook-benchmark: ColBERT Search

fastbook-benchmark: Single Vector Search

fastbook-benchmark: Single Vector Search

fastbook-benchmark: Scoring Retrieval Results

fastbook-benchmark: Scoring Retrieval Results

fastbook-benchmark: Full Text Search Implementation

fastbook-benchmark: Full Text Search Implementation

fastbook-benchmark: Document Processing

fastbook-benchmark: Document Processing

Introducing the fastbook-benchmark Information Retrieval QA Dataset

Introducing the fastbook-benchmark Information Retrieval QA Dataset

Implementing Image to Image Generation in Stable Diffusion from Scratch | fastai Part 2

Implementing Image to Image Generation in Stable Diffusion from Scratch | fastai Part 2

Implementing Negative Prompting in Stable Diffusion from Scratch | fastai Part 2

Implementing Negative Prompting in Stable Diffusion from Scratch | fastai Part 2

fastai - Chapter 8 - Collaborative Filtering Deep Dive Code Walkthrough

fastai - Chapter 8 - Collaborative Filtering Deep Dive Code Walkthrough

Implementing a Custom Test Time Augmentation Method using fastai

Implementing a Custom Test Time Augmentation Method using fastai

fastai - Chapter 6 - Building Single, Multi-label Classification and Image Regression Models

fastai - Chapter 6 - Building Single, Multi-label Classification and Image Regression Models

動画

Understanding ColBERT's ivf.pid.pt: Inspecting Intermediate Artifacts from _build_ivf & optimize_ivf

Understanding ColBERT's ivf.pid.pt: Inspecting Intermediate Artifacts from _build_ivf & optimize_ivf

23 回視聴 - 3 日前

Do RAGatouille and ColBERT Produce the Same Index and Retrieval Scores? A Deep Dive Comparison

Do RAGatouille and ColBERT Produce the Same Index and Retrieval Scores? A Deep Dive Comparison

22 回視聴 - 7 日前

TIL: Understanding LLM Foundry's BinPackCollator (Sequence Packing for 95% Token Efficiency!)

TIL: Understanding LLM Foundry's BinPackCollator (Sequence Packing for 95% Token Efficiency!)

39 回視聴 - 8 日前

Improving LLM Judge Alignment: Enhancing TinyScale Lab Evaluation Agreement to 94%

Improving LLM Judge Alignment: Enhancing TinyScale Lab Evaluation Agreement to 94%

27 回視聴 - 10 日前

Technical Report Summary: Nomic Embed

Technical Report Summary: Nomic Embed

37 回視聴 - 10 日前

Understanding Sequence Packing: Initial Musings

Understanding Sequence Packing: Initial Musings

48 回視聴 - 12 日前

Building an LLM Judge Agreement App: 7 Iterations from Basic to Full Functionality

Building an LLM Judge Agreement App: 7 Iterations from Basic to Full Functionality

30 回視聴 - 13 日前

Evaluating First Attempt LLM Judge Scores: Improving Claude Haiku Alignment for Story Scoring

Evaluating First Attempt LLM Judge Scores: Improving Claude Haiku Alignment for Story Scoring

18 回視聴 - 2 週間前

Manual Scoring Results for TinyStories Models: Grammar, Reasoning, and Emergent Capabilities

Manual Scoring Results for TinyStories Models: Grammar, Reasoning, and Emergent Capabilities

7 回視聴 - 2 週間前

Look at Your Data: Building an LM Scoring App with FastHTML

Look at Your Data: Building an LM Scoring App with FastHTML

30 回視聴 - 2 週間前

TSL: Curating Evaluation Prompts, Defining Scoring Criteria + Designing LLM Judge Prompt Template

TSL: Curating Evaluation Prompts, Defining Scoring Criteria + Designing LLM Judge Prompt Template

20 回視聴 - 2 週間前

TinyScale Lab Update: Setting Eval Targets + Generation Completions for LLM Judge Development

TinyScale Lab Update: Setting Eval Targets + Generation Completions for LLM Judge Development

9 回視聴 - 2 週間前