<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
  <channel>
    <title>Saheb&#39;s AI Journey</title>
    <link>https://saheb.github.io/blog/</link>
    <description>Recent content on Saheb&#39;s AI Journey</description>
    <generator>Hugo</generator>
    <language>en-us</language>
    <lastBuildDate>Mon, 30 Mar 2026 00:00:00 +0000</lastBuildDate>
    <atom:link href="https://saheb.github.io/blog/index.xml" rel="self" type="application/rss+xml" />
    <item>
      <title>The Day My AI Editor Went Silent: Debugging Git Worktrees, Phantom Configs, and Lost Weights</title>
      <link>https://saheb.github.io/blog/posts/git-worktree-unset/</link>
      <pubDate>Mon, 30 Mar 2026 00:00:00 +0000</pubDate>
      <guid>https://saheb.github.io/blog/posts/git-worktree-unset/</guid>
      <description>&lt;p&gt;Building reinforcement learning environments is already a complex exercise in managing state, rewards, and training loops. You expect the agent you are training to act unpredictably. You don&amp;rsquo;t expect the AI agent writing your code to completely flatline.&lt;/p&gt;&#xA;&lt;p&gt;While working on some recent RL projects, my AI code editor—Google Antigravity—just gave up. Every time I hit send, it would emit a single beep. No error message. No spinning loader. Just a beep and a dead UI.&lt;/p&gt;</description>
    </item>
    <item>
      <title>Reinforcement Learning with Snake (1)</title>
      <link>https://saheb.github.io/blog/posts/reinforcement-learning-snake-1/</link>
      <pubDate>Sun, 01 Mar 2026 00:00:00 +0000</pubDate>
      <guid>https://saheb.github.io/blog/posts/reinforcement-learning-snake-1/</guid>
      <description>&lt;p&gt;Reinforcement learning is interesting because it is intuitive and it aligns with how we learn naturally. You touch hot water, you feel the burn, you don&amp;rsquo;t touch it again. You learn by trial and error; you learn from experience. By interacting with the world, you get feedback in the form of reward or punishment, and that defines your behaviour.&lt;/p&gt;&#xA;&lt;p&gt;You notice there is a knob next to the water tap. It has a red arrow and a blue arrow on it. You press it — nothing happens. You pull it — nothing happens. You turn it and voila, the water starts again. You dare to touch it again, and this time it&amp;rsquo;s slightly less hot … and that&amp;rsquo;s how you learn by exploring. Either you can do this yourself, or someone teaches you and robs you of the opportunity of figuring it out on your own.&lt;/p&gt;</description>
    </item>
  </channel>
</rss>
