<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:atom="http://www.w3.org/2005/Atom" version="2.0">
  <channel>
    <title>Bijay's Blog</title>
    <link>https://blog.regmi.dev</link>
    <description>Personal Blog about anything that occurs to me</description>
    <language>en</language>
    <lastBuildDate>Mon, 25 May 2026 12:57:14 +0000</lastBuildDate>
    <copyright>© 2026 Bijay's Blog</copyright>
    <atom:link rel="self" href="https://blog.regmi.dev/feed.xml" type="application/rss+xml"/>
    <item>
      <title>Understanding LLMs and Modern Inference Engines</title>
      <link>https://blog.regmi.dev/post/understanding-llms-and-modern-inference-engines</link>
      <description>Choosing an LLM inference engine is a hardware-and-systems decision, not a meme. For real self-hosting, runtime, throughput, concurrency, and cost matter as much as the model.</description>
      <guid>https://blog.regmi.dev/post/understanding-llms-and-modern-inference-engines</guid>
      <pubDate>Mon, 25 May 2026 12:57:14 +0000</pubDate>
      <author>bijay@regmi.dev</author>
      <dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">bijayregmi</dc:creator>
      <category>llms</category>
      <category>inference</category>
      <category>inference-engines</category>
      <category>vllm</category>
      <category>llama-cpp</category>
      <category>tensorrt-llm</category>
      <category>sglang</category>
      <category>self-hosting</category>
      <category>open-source-models</category>
      <category>gpu</category>
      <category>nvidia</category>
      <category>ai-infrastructure</category>
    </item>
    <item>
      <title>State of Naïve RAG vs Agentic RAG in 2026</title>
      <link>https://blog.regmi.dev/post/state-of-naive-rag-vs-agentic-rag-in-2026</link>
      <description>RAG is not dead. In 2026, agentic RAG often beats naïve RAG for accuracy and complex retrieval, but naïve RAG still wins for simple, fast, low-cost use cases.</description>
      <guid>https://blog.regmi.dev/post/state-of-naive-rag-vs-agentic-rag-in-2026</guid>
      <pubDate>Sun, 17 May 2026 09:59:59 +0000</pubDate>
      <author>bijay@regmi.dev</author>
      <dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">bijayregmi</dc:creator>
      <category>rag</category>
      <category>data_engineering</category>
      <category>ai_engineering</category>
      <category>en</category>
      <category>english</category>
    </item>
    <item>
      <title>Ein medizinisches Modell mit synthetischen Daten</title>
      <link>https://blog.regmi.dev/post/ein-medizinisches-modell-mit-synthetischen-daten</link>
      <description>Optimierte KI-Modelle für die medizinische Kodierung: Wie BERT und synthetische Daten den Klinikalltag revolutionieren.</description>
      <guid>https://blog.regmi.dev/post/ein-medizinisches-modell-mit-synthetischen-daten</guid>
      <pubDate>Sun, 17 May 2026 09:59:59 +0000</pubDate>
      <author>bijay@regmi.dev</author>
      <dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">bijayregmi</dc:creator>
      <category>medizin</category>
      <category>ki</category>
      <category>ai</category>
      <category>de</category>
      <category>german</category>
    </item>
  </channel>
</rss>