Exploring ONNX, Embedding Models, and Retrieval Augmented Generation (RAG) with Langchain4j
A conversation with Dmytro Liubarskyi about ONNX, RAG, Quarkus,
Langchain4j and MicroProfile
1 Stunde 9 Minuten
Podcast
Podcaster
Java, Serverless, Clouds, Architecture and Web conversations with Adam Bien
Beschreibung
vor 1 Jahr
An airhacks.fm conversation with Dmytro Liubarskyi (@langchain4j)
about: Dmytro previously on "#285 How LangChain4j Happened",
discussion about ONNX format and runtime for running neural network
models in Java, using langchain4j library for seamless integration
and data handling, embedding models for converting text into vector
representations, strategies for handling longer text inputs by
splitting and averaging embeddings, overview of the retrieval
augmented generation (RAG) pipeline and its components, using
embeddings for query transformation, routing, and data source
selection in RAG, integrating Langchain4j with quarkus and CDI for
building AI-powered applications, Langchain4j provides pre-packaged
ONNX models as Maven dependencies, embedding models are faster and
smaller compared to full language models, possibilities of using
embeddings for query expansion, summarization, and data source
selection, cross-checking model outputs using embeddings or another
language model, decomposing complex AI services into smaller,
specialized sub-modules, injecting the right tools and data based
on query classification
Dmytro Liubarskyi on twitter: @langchain4j
about: Dmytro previously on "#285 How LangChain4j Happened",
discussion about ONNX format and runtime for running neural network
models in Java, using langchain4j library for seamless integration
and data handling, embedding models for converting text into vector
representations, strategies for handling longer text inputs by
splitting and averaging embeddings, overview of the retrieval
augmented generation (RAG) pipeline and its components, using
embeddings for query transformation, routing, and data source
selection in RAG, integrating Langchain4j with quarkus and CDI for
building AI-powered applications, Langchain4j provides pre-packaged
ONNX models as Maven dependencies, embedding models are faster and
smaller compared to full language models, possibilities of using
embeddings for query expansion, summarization, and data source
selection, cross-checking model outputs using embeddings or another
language model, decomposing complex AI services into smaller,
specialized sub-modules, injecting the right tools and data based
on query classification
Dmytro Liubarskyi on twitter: @langchain4j
Weitere Episoden
1 Stunde 12 Minuten
vor 7 Monaten
1 Stunde 6 Minuten
vor 7 Monaten
57 Minuten
vor 8 Monaten
1 Stunde 5 Minuten
vor 8 Monaten
1 Stunde 13 Minuten
vor 8 Monaten
In Podcasts werben
Kommentare (0)