Sumit @_reachsumit profile

Sumit

@_reachsumit

Followers

2,491

Following

423

Media

1,586

Statuses

6,848

Senior ML Engineer @Meta | prev: @TikTok_us , @Amazon , @Samsung | UChicago Alum 🇮🇳→🇰🇷→🇦🇺→🇨🇦→🇺🇲

https://t.co/OHPOnaN5yM

Seattle, WA

Joined April 2010

Don't wanna be here? Send us removal request.

Explore tweets Explore followers Explore following

Explore trending content on Musk Viewer

Errejón • 224716 Tweets

Mario • 112068 Tweets

الزمالك • 58645 Tweets

Sumar • 56097 Tweets

Mourinho • 42592 Tweets

Mudryk • 37902 Tweets

Lyon • 37094 Tweets

Onana • 32143 Tweets

#الاتحاد_الرياض • 30925 Tweets

McConnell • 25869 Tweets

Guanajuato • 24665 Tweets

Mitch • 23575 Tweets

Ten Hag • 22937 Tweets

السوبر المصري • 16063 Tweets

Mazraoui • 14320 Tweets

Abdelaziz Barrada • 14292 Tweets

Eriksen • 13695 Tweets

#FBvMUN • 12901 Tweets

Ugarte • 12660 Tweets

Amad • 12173 Tweets

Zirkzee • 12084 Tweets

#18YearsOfTaylorSwift • 11947 Tweets

Cerny

Maximin

لوران بلان

زياد الصحفي

Szymanski

العمري

زيزو

للأهلي

Puka

Osayi

Ersin

neil gaiman

En Nesyri

بنزيما

Haberimiz Olsun

الشناوي

Maguila

Dalot

Tadic

Muçi

連載マンガ

Werner

Semih

Lindelof

Mikey Moore

كانتي

#نواف_العقيدي

#سباق_مياه_فجر

Last Seen Profiles

@kimja_ming

@OoooDean16002

@SUZU__ota2

@Patrik_Rucki

@DholayareM8640

@MartinaHes78693

@torres_barbara7

@SalemRhose

@saychess1

@Mathis_glb

@Amr32125488

@snx_nft

@RarefiedLevin

@turk_ifsa2019

@AdnannBali

@HanexM23151

@kmonasa

@reddit_nba

@claims_lawyer

@ArdKesl

Pinned Tweet

Sumit

@_reachsumit

10 months

The previous article introduced the prompting-based techniques to exploit LLMs as text rerankers. In this article, we take a closer look at associated challenges and some of the potential improvements to make these methods more ranking-aware.

Strategies for Effective and Efficient Text Ranking Using Large Language Models

The previous article did a deep dive into the prompting-based pointwise, pairwise, and listwise techniques that directly use LLMs to perform reranking. In this article, we will take a closer look at...

blog.reachsumit.com

2

1

25

Sumit

@_reachsumit

8 months

Is Cosine-Similarity of Embeddings Really About Similarity? Netflix cautions against blindly using cosine similarity as a measure of semantic similarity between learned embeddings, as it can yield arbitrary and meaningless results. 📝

28

395

2K

Sumit

@_reachsumit

5 months

RAG Does Not Work for Enterprises Explores the challenges and requirements for implementing RAG in enterprises proposing potential solutions like semantic search and hybrid queries, and an evaluation framework to validate enterprise-grade RAG solutions 📝

19

148

808

Sumit

@_reachsumit

9 months

Foundations of Vector Retrieval This 185-page monograph provides a summary of major algorithmic milestones in the vector retrieval literature, with the goal of serving as a self-contained reference for new and established researchers. 📝

4

75

369

Sumit

@_reachsumit

3 months

REAPER: Reasoning based Retrieval Planning for Complex RAG Systems Amazon presents an LLM-based planner for generating efficient retrieval plans in conversational AI systems offering reduced latency, higher accuracy, and easy scalability. 📝

5

56

214

Sumit

@_reachsumit

3 months

A Comprehensive Survey of LLM Alignment Techniques: RLHF, RLAIF, PPO, DPO and More Salesforces presents a survey of LLM alignment methods, categorizing approaches into four main topics and identifying future research directions. 📝

1

55

200

Sumit

@_reachsumit

1 month

Retrieval Augmented Generation (RAG) and Beyond: A Comprehensive Survey on How to Make your LLMs use External Data More Wisely Microsoft categorizes data-augmented LLM queries and proposes strategies to tackle challenges in specialized domains. 📝

3

58

201

Sumit

@_reachsumit

8 months

ColBERT-XM: A Modular Multi-Vector Representation Model for Zero-Shot Multilingual Information Retrieval Introduces a multilingual dense retrieval model that achieves zero-shot transfer to other languages. 📝 👨🏽‍💻

1

40

199

Sumit

@_reachsumit

4 months

A Survey on Mixture of Experts Provides a comprehensive review of MoE models in LLMs, introducing a new taxonomy and covering algorithmic advancements, system designs, and applications. 📝 👨🏽‍💻

0

51

198

Sumit

@_reachsumit

9 months

Large Language Models: A Survey This paper surveys recent advances in large language models, including prominent models like GPT, LLaMA and PaLM, their construction, capabilities, applications, benchmarks, and open challenges. 📝

2

60

179

Sumit

@_reachsumit

5 months

CRAG -- Comprehensive RAG Benchmark Meta presents a factual QA benchmark with 4,409 diverse questions, mock APIs, and realistic challenges, designed to evaluate RAG systems for LLMs. 📝 👨🏽‍💻

1

38

178

Sumit

@_reachsumit

2 months

In Defense of RAG in the Era of Long-Context Language Models NVIDIA argues that long-context LLMs can be overwhelmed by irrelevant information, and proposes an order-preserving RAG method that retrieves relevant context chunks first. 📝

1

39

167

Sumit

@_reachsumit

10 days

Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free Combines routing weights and hidden states from MoE LLMs to create superior embeddings without additional training. 📝 👨🏽‍💻

2

47

170

Sumit

@_reachsumit

2 months

rerankers: A Lightweight Python Library to Unify Ranking Methods Introduces a Python library that simplifies the use of various re-ranking methods in information retrieval by providing a unified, easy-to-use interface. 📝 👨🏽‍💻

1

23

166

Sumit

@_reachsumit

3 months

How Can Recommender Systems Benefit from Large Language Models: A Survey Examines the integration of LLMs into RecSys, exploring where and how LLMs can enhance various stages of the recommendation pipeline. 📝 👨🏽‍💻

0

42

164

Sumit

@_reachsumit

4 months

Efficient Document Ranking with Learnable Late Interactions Google introduces a new learnable late-interaction model for query-document relevance that outperforms existing models in accuracy while reducing latency and storage costs. 📝

0

25

164

Sumit

@_reachsumit

4 months

BM25S: Orders of magnitude faster lexical search via eager sparse scoring Introduces a fast Python implementation of BM25 that pre-computes scores during indexing using sparse matrices to achieve significant speed improvements 📝 👨🏽‍💻

0

26

161

Sumit

@_reachsumit

5 months

MUVERA: Multi-Vector Retrieval via Fixed Dimensional Encodings Google proposes a retrieval mechanism that reduces multi-vector retrieval to single-vector retrieval by constructing Fixed Dimensional Encodings of a multi-vector representation. 📝

2

38

161

Sumit

@_reachsumit

2 months

HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction Combines VectorRAG and GraphRAG techniques to improve information extraction from financial documents. 📝

1

40

160

Sumit

@_reachsumit

4 months

FACTS About Building Retrieval Augmented Generation-based Chatbots NVIDIA introduces a framework and 15 RAG pipeline control points for building effective enterprise chatbots, providing empirical insights on LLM performance tradeoffs. 📝

0

40

158

Sumit

@_reachsumit

5 months

GRAG: Graph Retrieval-Augmented Generation Enhances LLMs' generation capabilities in graph contexts by efficiently retrieving relevant textual subgraphs and integrating them through dual prompting. 📝

1

33

158

Sumit

@_reachsumit

8 months

A Systematic Survey of Prompt Engineering in Large Language Models: Techniques and Applications This survey categorizes and analyzes 29 prompt engineering techniques for adapting LLMs across tasks without retraining & also highlights several challenges 📝

1

38

155

Sumit

@_reachsumit

1 month

RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval Accelerates attention computation in LLMs by using a vector search-based approach to retrieve key-value pairs from CPU memory. 📝 👨🏽‍💻

1

29

156

Sumit

@_reachsumit

6 months

RAGCache: Efficient Knowledge Caching for Retrieval-Augmented Generation Presents a novel multilevel dynamic caching system that efficiently caches and shares intermediate states of retrieved documents in RAG for LLMs. 📝

0

37

151

Sumit

@_reachsumit

2 months

StructuredRAG: JSON Response Formatting with Large Language Models @CShorten30 et al. introduce a benchmark for evaluating LLMs' ability to generate structured JSON outputs, revealing varied performance across tasks and models 📝 👨🏽‍💻

3

36

153

Sumit

@_reachsumit

5 months

A Survey on RAG Meets LLMs: Towards Retrieval-Augmented Large Language Models Comprehensively reviews RA-LLMs, covering their architectures, training methods, limitations, future directions, and applications in enhancing LLM generation capabilities. 📝

0

43

152

Sumit

@_reachsumit

3 months

PersonaRAG: Enhancing Retrieval-Augmented Generation Systems with User-Centric Agents Enhances RAG models by incorporating user-centric agents, adapting retrieval and generation based on real-time user data. 📝 👨🏽‍💻

0

38

152

Sumit

@_reachsumit

5 months

Large Language Models Meet NLP: A Survey Provides a comprehensive survey of how LLMs are applied to NLP tasks, introducing a new taxonomy and discussing current progress, future frontiers, and challenges. 📝 👨🏽‍💻

1

43

151

Sumit

@_reachsumit

2 months

A Guide to Similarity Measures Provides a comprehensive guide to similarity measures used across various data science fields, offering detailed explanations and principles to understand, select, and design appropriate measures for diverse applications. 📝

1

39

143

Sumit

@_reachsumit

4 months

On LLMs-Driven Synthetic Data Generation, Curation, and Evaluation: A Survey Provides a comprehensive review of LLM-driven synthetic data generation, organizing existing studies into a unified framework of generation, curation, and evaluation. 📝

0

38

143

Sumit

@_reachsumit

10 months

User Embedding Model for Personalized Language Prompting Google Research proposes a User Embedding Module to compress free-text user histories into embeddings for prompting Large Language Models to improve recommendation accuracy. 📝

0

24

139

Sumit

@_reachsumit

6 months

SPLATE: Sparse Late Interaction Retrieval Adapts the ColBERTv2 model to map its embeddings to a sparse space, enabling efficient sparse retrieval for candidate generation in the late interaction paradigm. 📝

3

20

143

Sumit

@_reachsumit

10 months

Searching, fast and slow, through product catalogs Microsoft presents a fast and accurate SKU search system for CRMs combining Trie-based suggestions, TF-IDF retrieval, and language model embeddings, outperforming existing systems. 📝

1

22

142

Sumit

@_reachsumit

3 months

Speculative RAG: Enhancing Retrieval Augmented Generation through Drafting Google improves RAG systems by using a smaller model to generate multiple draft answers from partitioned document subsets, which are then verified by a larger model. 📝

1

29

142

Sumit

@_reachsumit

9 months

Health-LLM: Personalized Retrieval-Augmented Disease Prediction Model Proposes a framework integrating LLMs and medical expertise to enhance exploitation of health reports for disease prediction and preventative care. 📝 👨🏽‍💻

6

42

141

Sumit

@_reachsumit

17 days

TableRAG: Million-Token Table Understanding with Language Models Enables efficient large-scale table understanding for language models, using smart retrieval techniques to overcome context length limitations, while reducing token consumption. 📝

1

39

142

Sumit

@_reachsumit

5 months

Multi-Head RAG: Solving Multi-Aspect Problems with LLMs Enhances the retrieval accuracy of LLMs for complex multi-aspect queries by leveraging activations from the multi-head attention layer as embeddings. 📝 👨🏽‍💻

2

37

135

Sumit

@_reachsumit

14 days

TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text Pre-computes document KV caches offline, significantly reducing time-to-first-token and computational resource use. 📝 👨🏽‍💻

0

31

138

Sumit

@_reachsumit

2 months

Understanding the User: An Intent-Based Ranking Dataset Presents a new dataset that annotates multiple intents for complex queries from TREC-DL, using LLMs and crowdsourcing. 📝 👨🏽‍💻

1

33

136

Sumit

@_reachsumit

6 months

A Survey on Retrieval-Augmented Text Generation for Large Language Models Presents a comprehensive framework for understanding RAG, outlining its core components, evaluation methods, and future research directions. 📝

0

32

133

Sumit

@_reachsumit

3 months

From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future Examines the use of LLMs and agents in software engineering, covering six key areas and analyzing their differences, applications, and effectiveness. 📝

1

31

132

Sumit

@_reachsumit

1 month

Late Chunking: Contextual Chunk Embeddings Using Long-Context Embedding Models Jina AI presents a technique that improves text embeddings for retrieval tasks by encoding entire documents before splitting them. 📝 👨🏽‍💻

0

16

132

Sumit

@_reachsumit

2 months

CRAFT Your Dataset: Task-Specific Synthetic Dataset Generation Through Corpus Retrieval and Augmentation Presents a method for generating task-specific synthetic datasets using user-provided few-shot examples. 📝 👨🏽‍💻

0

30

130

Sumit

@_reachsumit

11 months

Dense X Retrieval: What Retrieval Granularity Should We Use? Proposes proposition-based retrieval, which outperforms passages and sentences by providing compact, factual expressions with context to enhance generalization. 📝 👨🏽‍💻

2

26

129

Sumit

@_reachsumit

1 year

Knowledge-Augmented Large Language Models for Personalized Contextual Query Suggestion Microsoft Research presents a method to personalize LLMs for search via entity-based user knowledge stores derived from logs. 📝

0

26

126

Sumit

@_reachsumit

1 year

Large Search Model: Redefining Search Stack in the Era of LLMs Microsoft proposes using a single large language model for all search tasks instead of many specialized models, formulating tasks as text generation from prompts. 📝

4

23

125

Sumit

@_reachsumit

1 year

RecMind: Large Language Model Powered Agent For Recommendation Introduces an autonomous recommender agent powered by LLMs, which leverages planning and external tools like Self-Inspiring (SI) to provide personalized recommendations. 📝

1

26

125

Sumit

@_reachsumit

10 months

A recent research direction has explored directly prompting LLMs to perform unsupervised ranking using pointwise, pairwise, or listwise techniques. Some of these techniques even surpass the performance of state-of-the-art supervised systems.

Prompting-based Methods for Text Ranking Using Large Language Models

Large Language Models (LLMs) have demonstrated impressive zero-shot performance on a wide variety of NLP tasks. Recently, there has been a growing interest in applying LLMs to zero-shot text ranking....

blog.reachsumit.com

11

123

Sumit

@_reachsumit

7 months

Enhancing LLM Factual Accuracy with RAG to Counter Hallucinations: A Case Study on Domain-Specific Queries in Private Knowledge-Bases Presents a system that uses RAG and a curated dataset to improve factual accuracy of LLMs 📝 👨🏽‍💻

0

39

126

Sumit

@_reachsumit

1 month

Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models Proposes a retrieval model that follows natural language instructions, enabling more flexible, and user-friendly search experiences. 📝 👨🏽‍💻

1

33

125

Sumit

@_reachsumit

8 months

Self-Retrieval: Building an Information Retrieval System with One Large Language Model Proposes an end-to-end, LLM-based IR system that leverages the LLM's capabilities for indexing, retrieval, and self-assessment. 📝 👨🏽‍💻

2

23

123

Sumit

@_reachsumit

5 months

FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research Presents an open-source toolkit that provides a modular framework, pre-implemented RAG algorithms, benchmark datasets, and auxiliary scripts. 📝 👨🏽‍💻

0

45

120

Sumit

@_reachsumit

3 months

Retrieval Augmented Generation or Long-Context LLMs? A Comprehensive Study and Hybrid Approach Google DeepMind compares RAG and long-context LLMs, finding LC outperforms but at a higher cost. Proposes a method to dynamically choose between RAG and LC. 📝

2

34

124

Sumit

@_reachsumit

5 months

Evaluation of Retrieval-Augmented Generation: A Survey Presents an analysis framework to systematically evaluate RAG systems by considering retrieval accuracy, generation quality, and additional factors 📝 👨🏽‍💻

1

34

124

Sumit

@_reachsumit

2 months

Graph Retrieval-Augmented Generation: A Survey Presents the first comprehensive survey of GraphRAG, detailing its workflow, technologies, applications, and future directions in improving information retrieval and generation. 📝

0

46

123

Sumit

@_reachsumit

10 days

Agentic Information Retrieval Proposes a paradigm using LLM agents to expand and transform traditional information retrieval, offering a unified, flexible approach to complex information tasks. 📝

0

36

125

Sumit

@_reachsumit

9 months

ResumeFlow: An LLM-facilitated Pipeline for Personalized Resume Generation and Refinement Presents a tool powered by LLMs like GPT-4 and Gemini that tailors resumes to specific job postings. 📝 👨🏽‍💻

1

26

121

Sumit

@_reachsumit

8 months

G-Retriever: Retrieval-Augmented Generation for Textual Graph Understanding and Question Answering Introduces a graph QA framework combining LLMs and GNNs with a retrieval method to enable conversing with textual graphs. 📝 👨🏽‍💻

1

27

121

Sumit

@_reachsumit

4 months

BERGEN: A Benchmarking Library for Retrieval-Augmented Generation Naver introduces a Python library for standardizing RAG experiments and reveals key insights through extensive benchmarking. 📝 👨🏽‍💻

0

31

122

Sumit

@_reachsumit

6 months

Fast Exact Retrieval for Nearest-neighbor Lookup (FERN) Proposes an algorithm for fast exact vector retrieval, inspired by kd-trees, that achieves logarithmic time complexity with 100% recall for high-dimensional vectors. 📝 👨🏽‍💻

1

26

122

Sumit

@_reachsumit

7 months

Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity Dynamically selects the most suitable retrieval-augmented strategy based on the predicted complexity level of input query 📝 👨🏽‍💻

3

30

115

Sumit

@_reachsumit

3 months

A Survey of Mamba Reviews Mamba architecture, highlighting its comparable modeling abilities to Transformers but with near-linear scalability for sequence length, and discusses its advancements, data adaptability, applications, and limitations. 📝

0

30

119

Sumit

@_reachsumit

5 months

xRAG: Extreme Context Compression for Retrieval-augmented Generation with One Token Proposes a context compression method that reinterprets document embeddings as retrieval modality features and integrates them into LMs. 📝 👨🏽‍💻

1

29

119

Sumit

@_reachsumit

3 months

A Comprehensive Review of Recommender Systems: Transitioning from Theory to Practice Examines Recommender Systems from 2017 to 2024, bridging theory and practice across various sectors, exploring advanced techniques, and addressing industry challenges. 📝

1

26

119

Sumit

@_reachsumit

9 months

In-context Learning with Retrieved Demonstrations for Language Models: A Survey Google Research offers a comprehensive analysis of retrieval-based ICL, highlighting key innovations and future paths to enhance demonstration relevance and diversity. 📝

0

18

115

Sumit

@_reachsumit

7 months

Efficient Multi-Vector Dense Retrieval Using Bit Vectors Introduces techniques like optimized bit vector filtering, SIMD-based centroid interaction, product quantization, and per-document term filtering. 📝 👨🏽‍💻

2

27

113

Sumit

@_reachsumit

10 months

Seven Failure Points When Engineering a Retrieval Augmented Generation System Investigates failure points of RAG systems through case studies, finding robustness emerges over time rather than from upfront design, and proposes future research directions 📝

0

21

107

Sumit

@_reachsumit

3 months

Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together Presents a method for optimizing multi-stage NLP pipelines by alternating between prompt optimization and LM weight fine-tuning. 📝 👨🏽‍💻

1

30

107

Sumit

@_reachsumit

5 months

AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings Enables any-granularity ranking using multi-vector embeddings with a coarser encoding level, and introduces a multi-granular contrastive loss to improve fine-grained ranking performance. 📝

0

17

105

Sumit

@_reachsumit

3 months

RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework Presents a framework for generating domain-specific datasets to evaluate RAG systems. Focuses on vertical domains and introduces new metrics to assess LLMs' knowledge usage. 📝

0

22

105

Sumit

@_reachsumit

3 months

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning Tsinghua University improves text embeddings in smaller language models (MiniCPM, Phi-2, and Gemma) through contrastive fine-tuning. 📝 👨🏽‍💻

2

29

105

Sumit

@_reachsumit

6 months

Beyond Chain-of-Thought: A Survey of Chain-of-X Paradigms for LLMs Categorizes and examines Chain-of-X (CoX) methods, which generalize the Chain-of-Thought prompting approach to enhance LLM capabilities across various components and application tasks. 📝

2

25

106

Sumit

@_reachsumit

5 months

Positional encoding is not the same as context: A study on positional encoding for Sequential recommendation Huawei analyzes positional encodings in transformer-based sequential recommendation systems, proposing new encodings. 📝 👨🏽‍💻

0

21

105

Sumit

@_reachsumit

2 months

Enhancing Relevance of Embedding-based Retrieval at Walmart Walmart presents techniques to enhance embedding-based neural retrieval for its product search, addressing data quality issues and query misspellings. 📝

1

37

104

Sumit

@_reachsumit

3 months

NV-Retriever: Improving text embedding models with effective hard-negative mining NVIDIA presents positive-aware hard-negative mining for text embedding models, and a model that topped the MTEB Retrieval benchmark in July'24. 📝 👨🏽‍💻

2

22

105

Sumit

@_reachsumit

3 months

EfficientRAG: Efficient Retriever for Multi-Hop Question Answering Presents a retriever for multi-hop QA that iteratively generates queries without using LLMs, outperforming existing RAG methods on three datasets while reducing latency and cost. 📝

0

31

104

Sumit

@_reachsumit

6 months

A Survey of Generative Search and Recommendation in the Era of Large Language Models Present a unified framework for the emerging generative paradigm in search and recommendation that leverages LLMs. 📝

0

18

103

Sumit

@_reachsumit

1 month

What is the Role of Small Models in the LLM Era: A Survey Explores the relationship between LLMs and small models, analyzing their collaborative potential and competitive advantages. 📝 👨🏽‍💻

0

31

103

Sumit

@_reachsumit

3 months

Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions Google presents a framework that optimizes LLM embeddings, reducing dimensionality without compromising performance. 📝

0

29

102

Sumit

@_reachsumit

2 months

Foundation Models for Music: A Survey Reviews the impact of foundation models like LLMs and latent diffusion models on music, highlighting their potential for advancing music understanding, generation, and medical applications 📝 👨🏽‍💻

0

32

102

Sumit

@_reachsumit

2 months

A Survey on Benchmarks of Multimodal Large Language Models Provides a comprehensive review of 180 benchmarks for evaluating Multimodal Large Language Models, categorizing them into five key areas. 📝 👨🏽‍💻

1

34

100

Sumit

@_reachsumit

9 months

Corrective Retrieval Augmented Generation Makes RAG models more robust by self-correcting inaccurate retrievals with confidence scores, web searches, and knowledge refinement, boosting generation accuracy. 📝 👨🏽‍💻

0

21

102

Sumit

@_reachsumit

2 months

RouterRetriever: Exploring the Benefits of Routing over Multiple Expert Embedding Models Introduces a flexible, multi-expert approach to information retrieval that routes queries to domain-specific experts. 📝 👨🏽‍💻

1

20

101

Sumit

@_reachsumit

2 months

Jina-ColBERT-v2: A General-Purpose Multilingual Late Interaction Retriever Enhances the ColBERT multi-vector model for multilingual retrieval, incorporating diverse training data and efficiency improvements. 📝 👨🏽‍💻

1

19

99

Sumit

@_reachsumit

7 months

LLM-Augmented Retrieval: Enhancing Retrieval Models Through Language Models and Doc-Level Embedding Improves the performance of existing retriever models by enriching document embeddings with contextual information. 📝

0

24

98

Sumit

@_reachsumit

6 months

Generative Information Retrieval Evaluation Examines the use of LLMs in two aspects of IR evaluation: leveraging LLMs as evaluation tools, and evaluating LLM-based generative IR systems, while addressing the continued need for human assessment. 📝

2

23

99

Sumit

@_reachsumit

3 months

Exploring Query Understanding for Amazon Product Search Amazon examines the role of query understanding in its Product Search, exploring its impact on ranking features, model evaluation, and proposing a framework, based on a year-long study. 📝

0

22

99

Sumit

@_reachsumit

3 months

Modular RAG: Transforming RAG Systems into LEGO-like Reconfigurable Frameworks Presents a paradigm that breaks down complex RAG systems into flexible, reconfigurable modules and operators. 📝

0

29

99

Sumit

@_reachsumit

3 months

Efficient Retrieval with Learned Similarities Introduces Mixture-of-Logits (MoL) as a universal approximator for learned similarity functions in retrieval tasks, proposing efficient techniques for approximate top-K retrieval. 📝

1

27

98

Sumit

@_reachsumit

9 months

It's About Time: Incorporating Temporality in Retrieval Augmented Language Models Augments neural retrievers with temporal relevance to handle evolving information better, crucial for temporal question answering and fact checking. 📝

0

23

97

Sumit

@_reachsumit

4 months

Re-Ranking Step by Step: Investigating Pre-Filtering for Re-Ranking with Large Language Models Introduces a pre-filtering method for IR systems that uses LLMs and minimal human input to remove irrelevant passages before re-ranking. 📝

0

14

98

Sumit

@_reachsumit

3 months

Golden-Retriever: High-Fidelity Agentic Retrieval Augmented Generation for Industrial Knowledge Base Enhances RAG with reflection-based question augmentation, improving retrieval accuracy by clarifying jargon and context before document retrieval. 📝

1

29

97

Sumit

@_reachsumit

2 months

Learning vs Retrieval: The Role of In-Context Examples in Regression with LLMs Explores in-context learning in large language models, proposing that it combines knowledge retrieval and learning from examples. 📝 👨🏽‍💻

2

18

97

Sumit

@_reachsumit

3 months

Mindful-RAG: A Study of Points of Failure in Retrieval Augmented Generation Identifies key issues in knowledge graph-based RAG systems for LLMs, focusing on question intent and context alignment. 📝

1

29

97

Sumit

@_reachsumit

1 month

Recommendation with Generative Models Offers a comprehensive exploration of generative models in recommender systems, introducing a novel taxonomy and covering system design, evaluation methods, and societal implications. 📝

3

33

96

Sumit

@_reachsumit

5 months

A Survey of Multimodal Large Language Model from A Data-centric Perspective Reviews data collection, processing, and evaluation methods for training and assessing multimodal LLMs, providing a data-centric perspective. 📝 👨🏽‍💻

0

24

95

Sumit

@_reachsumit

1 year

Learning to Retrieve In-Context Examples for Large Language Models Introduces an iterative framework for in-context learning with LLMs, which effectively retrieves high-quality examples to improve learning performance across a diverse set of NLP tasks. 📝

1

18

93

Sumit

@_reachsumit

1 year

Instruction Distillation Makes Large Language Models Efficient Zero-shot Rankers Distills complex pairwise ranking instructions into simpler pointwise instructions to improve the effectiveness of LLMs for zero-shot ranking. 📝 👨🏽‍💻

1

28

94

Sumit

@_reachsumit

3 months

Context Embeddings for Efficient Answer Generation in RAG Speeds up generation time while improving answer quality by compressing multiple contexts into a small number of embeddings, offering flexible compression rates. 📝

0

21

94

Sumit

@_reachsumit

3 months

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation Intel presents a framework that integrates training, inference, evaluation, and more to streamline the development of RAG systems for LLMs. 📝 👨🏽‍💻

0

25

94

Sumit

@_reachsumit

9 months

Multilingual E5 Text Embeddings: A Technical Report Microsoft presents the methodology and evaluations for releasing open-source multilingual E5 text embedding models in over 100 languages. 📝 👨🏽‍💻

0

28

94