Exploring ONNX, Embedding Models, and Retrieval Augmented Generation (RAG) with Langchain4j

Exploring ONNX, Embedding Models, and Retrieval Augmented Generation (RAG) with Langchain4j

A conversation with Dmytro Liubarskyi about ONNX, RAG, Quarkus, Langchain4j and MicroProfile
1 Stunde 9 Minuten
Podcast
Podcaster
Java, Serverless, Clouds, Architecture and Web conversations with Adam Bien

Beschreibung

vor 1 Jahr
An airhacks.fm conversation with Dmytro Liubarskyi (@langchain4j)
about: Dmytro previously on "#285 How LangChain4j Happened",
discussion about ONNX format and runtime for running neural network
models in Java, using langchain4j library for seamless integration
and data handling, embedding models for converting text into vector
representations, strategies for handling longer text inputs by
splitting and averaging embeddings, overview of the retrieval
augmented generation (RAG) pipeline and its components, using
embeddings for query transformation, routing, and data source
selection in RAG, integrating Langchain4j with quarkus and CDI for
building AI-powered applications, Langchain4j provides pre-packaged
ONNX models as Maven dependencies, embedding models are faster and
smaller compared to full language models, possibilities of using
embeddings for query expansion, summarization, and data source
selection, cross-checking model outputs using embeddings or another
language model, decomposing complex AI services into smaller,
specialized sub-modules, injecting the right tools and data based
on query classification

Dmytro Liubarskyi on twitter: @langchain4j

Weitere Episoden

Not Your Java Package Handler
1 Stunde 12 Minuten
vor 7 Monaten
From Punch Cards (and Tapes) to Java
1 Stunde 6 Minuten
vor 7 Monaten
Injection Without Reflection
57 Minuten
vor 8 Monaten
About Amazon Corretto
1 Stunde 5 Minuten
vor 8 Monaten

Kommentare (0)

Lade Inhalte...

Abonnenten

15
15