vishal

@vishal_learner - 47 本の動画

チャンネル登録者数 121人

Machine Learning.

最近の動画

Understanding ColBERT's ivf.pid.pt: Inspecting Intermediate Artifacts from _build_ivf & optimize_ivf 50:20

Understanding ColBERT's ivf.pid.pt: Inspecting Intermediate Artifacts from _build_ivf & optimize_ivf

Do RAGatouille and ColBERT Produce the Same Index and Retrieval Scores? A Deep Dive Comparison 19:13

Do RAGatouille and ColBERT Produce the Same Index and Retrieval Scores? A Deep Dive Comparison

TIL: Understanding LLM Foundry's BinPackCollator (Sequence Packing for 95% Token Efficiency!) 19:12

TIL: Understanding LLM Foundry's BinPackCollator (Sequence Packing for 95% Token Efficiency!)

Improving LLM Judge Alignment: Enhancing TinyScale Lab Evaluation Agreement to 94% 23:06

Improving LLM Judge Alignment: Enhancing TinyScale Lab Evaluation Agreement to 94%

Technical Report Summary: Nomic Embed 26:38

Technical Report Summary: Nomic Embed

Understanding Sequence Packing: Initial Musings 21:17

Understanding Sequence Packing: Initial Musings

Building an LLM Judge Agreement App: 7 Iterations from Basic to Full Functionality 15:06

Building an LLM Judge Agreement App: 7 Iterations from Basic to Full Functionality

Evaluating First Attempt LLM Judge Scores: Improving Claude Haiku Alignment for Story Scoring 44:00

Evaluating First Attempt LLM Judge Scores: Improving Claude Haiku Alignment for Story Scoring

Manual Scoring Results for TinyStories Models: Grammar, Reasoning, and Emergent Capabilities 36:39

Manual Scoring Results for TinyStories Models: Grammar, Reasoning, and Emergent Capabilities

Look at Your Data: Building an LM Scoring App with FastHTML 40:23

Look at Your Data: Building an LM Scoring App with FastHTML

TSL: Curating Evaluation Prompts, Defining Scoring Criteria + Designing LLM Judge Prompt Template 30:15

TSL: Curating Evaluation Prompts, Defining Scoring Criteria + Designing LLM Judge Prompt Template

TinyScale Lab Update: Setting Eval Targets + Generation Completions for LLM Judge Development 19:41

TinyScale Lab Update: Setting Eval Targets + Generation Completions for LLM Judge Development

TinyScaleLab Project Update: Training Cost Analysis and Evaluation Infrastructure Plans 9:24

TinyScaleLab Project Update: Training Cost Analysis and Evaluation Infrastructure Plans

TinyScale Lab: Exploring the Connection Between Training Dynamics and Model Capabilities 26:51

TinyScale Lab: Exploring the Connection Between Training Dynamics and Model Capabilities

Research Paper Summary: Small-scale proxies for large-scale Transformer training instabilities 1:04:03

Research Paper Summary: Small-scale proxies for large-scale Transformer training instabilities

My Second-Place Winning Tiny Model Hackathon Journey: Pre-Training from Scratch 22:47

My Second-Place Winning Tiny Model Hackathon Journey: Pre-Training from Scratch

LossInspector: A Deep Dive Into LLM-Foundry's Next-Token Prediction with a Custom Composer Callback 21:19

LossInspector: A Deep Dive Into LLM-Foundry's Next-Token Prediction with a Custom Composer Callback

The Evolution of Matrix Multiplication: 12,000x Numba Speedup 🚀 | fastai Course Lesson 11 28:41

The Evolution of Matrix Multiplication: 12,000x Numba Speedup 🚀 | fastai Course Lesson 11

Research Paper Summary: TinyStories 1:23:34

Research Paper Summary: TinyStories

Look at Your Data: Manual Validation of Retrieval Metrics 47:38

Look at Your Data: Manual Validation of Retrieval Metrics

Creating a Custom Composer Callback to Track Data Types in LLM Training | Mixed Precision Deep Dive 46:10

Creating a Custom Composer Callback to Track Data Types in LLM Training | Mixed Precision Deep Dive

Paper Reading: Small-scale proxies for large-scale Transformer training instabilities 1:16:36

Paper Reading: Small-scale proxies for large-scale Transformer training instabilities

Paper Reading: Overtrained Language Models Are Harder to Fine-Tune 1:36:58

Paper Reading: Overtrained Language Models Are Harder to Fine-Tune

Paper Reading: SmolLM2 55:01

Paper Reading: SmolLM2

Exploring Sequential and Merged Linear Layer Forward Passes 20:16

Exploring Sequential and Merged Linear Layer Forward Passes

TIL: Using PyTorch's register_forward_hook to Trace Floating Point Errors 9:21

TIL: Using PyTorch's register_forward_hook to Trace Floating Point Errors

Debugging Un-Merged and Merged LoRA Model Output Differences 26:21

Debugging Un-Merged and Merged LoRA Model Output Differences

LoraModel.merge_and_unload Deep Dive 18:10

LoraModel.merge_and_unload Deep Dive

RAGatouille/ColBERT Indexing Deep Dive 1:05:43

RAGatouille/ColBERT Indexing Deep Dive

Recreating Plots from Appendix A.4 of the DoRA Paper for LoRA Learns Less and Forgets Less Models 25:43

Recreating Plots from Appendix A.4 of the DoRA Paper for LoRA Learns Less and Forgets Less Models

TIL: PeftModel Base Model Behavior 17:43

TIL: PeftModel Base Model Behavior

Research Paper Summary: Hypencoder: Hypernetworks for Information Retrieval 41:51

Research Paper Summary: Hypencoder: Hypernetworks for Information Retrieval

Code Walkthrough - peft DoRA Implementation 26:53

Code Walkthrough - peft DoRA Implementation

Research Paper Summary: rsLoRA 19:42

Research Paper Summary: rsLoRA

Research Paper Summary: LoRA Learns Less and Forgets Less 36:57

Research Paper Summary: LoRA Learns Less and Forgets Less

Recreating the PLAID ColBERTv2 Scoring Pipeline: From Research Code to RAGatouille 1:14:59

Recreating the PLAID ColBERTv2 Scoring Pipeline: From Research Code to RAGatouille

fastbook-benchmark: ColBERT Search 11:48

fastbook-benchmark: ColBERT Search

fastbook-benchmark: Single Vector Search 18:09

fastbook-benchmark: Single Vector Search

fastbook-benchmark: Scoring Retrieval Results 20:36

fastbook-benchmark: Scoring Retrieval Results

fastbook-benchmark: Full Text Search Implementation 16:37

fastbook-benchmark: Full Text Search Implementation

fastbook-benchmark: Document Processing 17:01

fastbook-benchmark: Document Processing

Introducing the fastbook-benchmark Information Retrieval QA Dataset 4:27

Introducing the fastbook-benchmark Information Retrieval QA Dataset

Implementing Image to Image Generation in Stable Diffusion from Scratch | fastai Part 2 21:27

Implementing Image to Image Generation in Stable Diffusion from Scratch | fastai Part 2

Implementing Negative Prompting in Stable Diffusion from Scratch | fastai Part 2 37:34

Implementing Negative Prompting in Stable Diffusion from Scratch | fastai Part 2

fastai - Chapter 8 - Collaborative Filtering Deep Dive Code Walkthrough 38:55

fastai - Chapter 8 - Collaborative Filtering Deep Dive Code Walkthrough

Implementing a Custom Test Time Augmentation Method using fastai 25:12

Implementing a Custom Test Time Augmentation Method using fastai

fastai - Chapter 6 - Building Single, Multi-label Classification and Image Regression Models 2:41:31

fastai - Chapter 6 - Building Single, Multi-label Classification and Image Regression Models

動画

Understanding ColBERT's ivf.pid.pt: Inspecting Intermediate Artifacts from _build_ivf & optimize_ivf 50:20

Understanding ColBERT's ivf.pid.pt: Inspecting Intermediate Artifacts from _build_ivf & optimize_ivf

23 回視聴 - 3 日前

Do RAGatouille and ColBERT Produce the Same Index and Retrieval Scores? A Deep Dive Comparison 19:13

Do RAGatouille and ColBERT Produce the Same Index and Retrieval Scores? A Deep Dive Comparison

22 回視聴 - 7 日前

TIL: Understanding LLM Foundry's BinPackCollator (Sequence Packing for 95% Token Efficiency!) 19:12

TIL: Understanding LLM Foundry's BinPackCollator (Sequence Packing for 95% Token Efficiency!)

39 回視聴 - 8 日前

Improving LLM Judge Alignment: Enhancing TinyScale Lab Evaluation Agreement to 94% 23:06

Improving LLM Judge Alignment: Enhancing TinyScale Lab Evaluation Agreement to 94%

27 回視聴 - 10 日前

Technical Report Summary: Nomic Embed 26:38

Technical Report Summary: Nomic Embed

37 回視聴 - 10 日前

Understanding Sequence Packing: Initial Musings 21:17

Understanding Sequence Packing: Initial Musings

48 回視聴 - 12 日前

Building an LLM Judge Agreement App: 7 Iterations from Basic to Full Functionality 15:06

Building an LLM Judge Agreement App: 7 Iterations from Basic to Full Functionality

30 回視聴 - 13 日前

Evaluating First Attempt LLM Judge Scores: Improving Claude Haiku Alignment for Story Scoring 44:00

Evaluating First Attempt LLM Judge Scores: Improving Claude Haiku Alignment for Story Scoring

18 回視聴 - 2 週間前

Manual Scoring Results for TinyStories Models: Grammar, Reasoning, and Emergent Capabilities 36:39

Manual Scoring Results for TinyStories Models: Grammar, Reasoning, and Emergent Capabilities

7 回視聴 - 2 週間前

Look at Your Data: Building an LM Scoring App with FastHTML 40:23

Look at Your Data: Building an LM Scoring App with FastHTML

30 回視聴 - 2 週間前

TSL: Curating Evaluation Prompts, Defining Scoring Criteria + Designing LLM Judge Prompt Template 30:15

TSL: Curating Evaluation Prompts, Defining Scoring Criteria + Designing LLM Judge Prompt Template

20 回視聴 - 2 週間前

TinyScale Lab Update: Setting Eval Targets + Generation Completions for LLM Judge Development 19:41

TinyScale Lab Update: Setting Eval Targets + Generation Completions for LLM Judge Development

9 回視聴 - 2 週間前