LLama2.java: LLM integration with A 100% Pure Java file
A conversation with Alfonso Peterssen about Pure Java LLM Inference
1 Stunde 1 Minute
Podcast
Podcaster
Java, Serverless, Clouds, Architecture and Web conversations with Adam Bien
Beschreibung
vor 1 Jahr
An airhacks.fm conversation with Alfonso Peterssen (@TheMukel)
about: discussion about Alfonso's early programming experience and
participation in the IOI competition, studying computer science and
functional programming with Martin Odersky, internships at Google
and Oracle Labs working on compilers and the Espresso project
implementing a JVM in Java, espresso mentioned in "#208 GraalVM:
Meta Circularity on Different Levels", "#194 GraalVM, Apple Silicon
(M1) and Clouds", "#167 GraalVM and Java 17, Truffle, Espresso and
Native Image" and "#157 The Ingredients of GraalVM", porting LLVM
to pure Java in one class, integrating Large Language Models (LLMs)
in Java by porting the LLAMA model from C to Java, GPU acceleration
with tornadovm, TornadoVM appeared at "#282 TornadoVM, Paravox.ai:
Java, AI, LLMs and Hardware Acceleration", performance of the Java
port being within 10% of the C versions, potential huge
opportunities for integrating AI and LLMs with enterprise Java
systems for use cases like fraud detection, the Java port being a
1,000 line self-contained implementation with no external
dependencies, the need for more resources and support to further
develop the Java LLM integration, the llama2.java project
Alfonso Peterssen on twitter: @TheMukel
about: discussion about Alfonso's early programming experience and
participation in the IOI competition, studying computer science and
functional programming with Martin Odersky, internships at Google
and Oracle Labs working on compilers and the Espresso project
implementing a JVM in Java, espresso mentioned in "#208 GraalVM:
Meta Circularity on Different Levels", "#194 GraalVM, Apple Silicon
(M1) and Clouds", "#167 GraalVM and Java 17, Truffle, Espresso and
Native Image" and "#157 The Ingredients of GraalVM", porting LLVM
to pure Java in one class, integrating Large Language Models (LLMs)
in Java by porting the LLAMA model from C to Java, GPU acceleration
with tornadovm, TornadoVM appeared at "#282 TornadoVM, Paravox.ai:
Java, AI, LLMs and Hardware Acceleration", performance of the Java
port being within 10% of the C versions, potential huge
opportunities for integrating AI and LLMs with enterprise Java
systems for use cases like fraud detection, the Java port being a
1,000 line self-contained implementation with no external
dependencies, the need for more resources and support to further
develop the Java LLM integration, the llama2.java project
Alfonso Peterssen on twitter: @TheMukel
Weitere Episoden
1 Stunde 12 Minuten
vor 7 Monaten
1 Stunde 6 Minuten
vor 7 Monaten
57 Minuten
vor 8 Monaten
1 Stunde 5 Minuten
vor 8 Monaten
1 Stunde 13 Minuten
vor 8 Monaten
In Podcasts werben
Kommentare (0)