Elmus W.

Systems & Software Engineer

0%
Initializing
{
}
<
>
;
=
(
)
Portfolio

EFH Innovation Sprint: Multimodal RAG System

EFH Innovation Sprint - Team Project

Developed in a 5-person team at the Ecosystems, Finance & Health (EFH) Innovation Sprint hackathon (March 5-7, 2025, Nairobi). The pipeline uses multimodal RAG (text, tables, and images) to make complex scientific PDFs on climate-biodiversity-health linkages queryable and understandable. Built with LangChain, LangGraph concepts, unstructured.io for PDF parsing, vision-language models for image summarisation, and Together AI LLMs/embeddings. Won "Best Overall Code" award and placed 6th overall due to late submission.

PythonLangChainTogether AIunstructuredLlama-3-VisionLlama-3-70BpandasTorch

Impact & Results

  • Built a functional multimodal RAG pipeline in <3 days
  • Won "Best Overall Code" in competitive Kenya AI hackathon
  • Gained hands-on experience with modern RAG techniques, vision LLMs, and agentic patterns
  • Explored multiple LLM providers and embedding models

Architecture

  • PDF -> unstructured.io (hi-res) -> text/table/image extraction
  • Image summarisation with Llama-3.2-Vision
  • Text/table summarisation with Llama-3-70B
  • Multi-vector retriever with parent/child documents and summaries
  • In-memory vector store + byte store for fast prototyping

Challenges

  • Processing large scientific PDFs with mixed content (tables, figures)
  • Handling vision summarisation at scale during hackathon constraints
  • Balancing retrieval accuracy vs. response quality under time pressure
  • Late submission impacting final ranking despite strong code quality

Solutions

  • Used unstructured[all-docs] hi-res strategy for robust extraction
  • Parallel batch summarisation with concurrency limits
  • Multi-vector retriever pattern for better precision
  • Learned importance of early submission for future hackathons

Key Takeaways

Advanced RAG patterns: multi-vector, parent/child docs, image summarisation
Practical trade-offs when using vision-language models
Team collaboration under tight hackathon deadlines
Value of clean, well-structured code in competitive settings

Project Gallery

Browse through project illustrations