IAA-RAG brings state-of-the-art artificial intelligence to the astronautics community — instant answers, deep research and automated reports from the entire IAA document corpus. Powered by Google Gemini 2.0.
What We Offer
IAA-RAG redefines how researchers interact with scientific knowledge — complemented by Indico for seamless event and conference management.
Ask any question — IAA-RAG retrieves precise answers with citations from thousands of IAA documents, papers and video transcripts in seconds.
Indico powers IAA event registration, programs, materials and archives — the trusted platform used by CERN and top research institutions worldwide.
Institutional authentication, document and data access control, compliance with international security standards.
Connected to the international IAA network, synchronized with iaaspace.org and providing access to global publications and resources.
Repository of IAA session papers, presentations and reports, indexed and searchable through the AI engine.
Containerized, redundant infrastructure capable of supporting large volumes of users and documents simultaneously.
IAA-RAG v2
A state-of-the-art Retrieval-Augmented Generation system engineered from the ground up at the Institute of Space Science — combining cutting-edge AI models, hybrid search, knowledge graphs and real-time document intelligence.
Technology Stack
Google's latest LLM for generation, vision analysis and cross-document reasoning. LaTeX, images and academic citations fully supported.
Semantic vector search (768-dim embeddings via gemini-embedding-001) combined with BM25 lexical search and re-ranking for top-35 precision retrieval.
3,300+ entities and 1,600+ relations extracted from documents. Powers contradiction detection, cross-document analysis and agentic reasoning.
FastAPI streaming responses with Server-Sent Events. Answers appear token-by-token, no waiting. Parallel worker processing via Redis queues.
PDF, DOCX, PPTX, XLSX, images (AI vision analysis), YouTube transcripts, web scraping — all automatically chunked, embedded and indexed.
Multi-step reasoning engine that decomposes complex questions into sub-queries, researches each independently, then synthesizes a comprehensive answer.
One click generates a full academic report — executive summary, structured sections, citations — exported as HTML ready for publication.
Fully containerized: FastAPI · PostgreSQL + pgvector · Qdrant · Neo4j · Redis · MinIO · Nginx · Prometheus/Grafana monitoring. 4 parallel worker replicas.
Every answer includes clickable inline citations [1] linked to the exact page or timestamp in the source document. Authors, journal, year and page number — automatically extracted. Click a citation to jump directly to the source.
Ingest any YouTube video — transcripts are automatically extracted, chunked and indexed. Citations link to the exact timestamp in the video. Images in documents are analyzed with AI vision and included in search.
The Knowledge Graph engine identifies when documents contradict each other on the same topic. Unique capability for research quality assurance — compare positions across years and authors automatically.
Full automation API: trigger document ingestion, query the RAG engine or generate reports from any external system via webhooks. Integrates with n8n, Zapier, Make and custom pipelines out of the box.
Platform Screenshots
Built at the Institute of Space Science · Powered by Google Gemini 2.0
Indico — developed by CERN and adopted by top research institutions worldwide — manages registrations, programs, materials and video recordings for all IAA Romania events. All conference documents are automatically ingested into IAA-RAG for AI-powered search.
Access
Access the IAA platforms with your institutional account. For a new account or technical support, contact the IAA Romania team.
Access issues? Contact contact@iaaspace.org