Project Proposal

Last updated January 23rd, 2026

Summary

Space Invaders is a classic fixed-shooter game where the player controls a starfighter, attacking and dodging enemies from above. Rather than treating this as a one-off application, our project uses Space Invaders as a controlled testbed for comparing and understanding reinforcement learning (RL) methods.

To make broad comparisons feasible, we will primarily use MinAtar SpaceInvaders through Gymnasium. MinAtar provides a simplified 10×10, channel-based observation space that trains much faster than full Atari, enabling us to run multiple seeds, hyperparameter sweeps, and component ablations within our compute budget. If time permits, we will validate key findings on the full ALE Space Invaders environment.

Project Goals

🎯 Minimum Goal (Baseline + Harness)

🚀 Realistic Goal (Battery of Methods)

🌙 Moonshot Goal (Ablations + Deeper Insights)

Methods to Compare

Baselines

Value-Based Methods

Policy-Gradient Method

Model-Based (If Time Permits)

Practical Method-Based Comparison Menu (Planned)

To keep the project method-driven and feasible, we plan to prioritize the following menu of methods/components (in roughly this order), adding items as our implementation bandwidth allows:

1) Random policy (naïve baseline) 2) DQN (value-based baseline) 3) Double DQN (overestimation reduction) 4) Rainbow-style components (run as a full Rainbow agent, then ablate components):

This menu is designed to support clear ablation questions (e.g., “turn off prioritized replay—how much does sample efficiency or stability change?”) and to compare major algorithm families (value-based vs policy-gradient) under a consistent Gymnasium/MinAtar protocol.

Evaluation Plan

We will evaluate methods using a consistent, reproducible protocol.

Metrics

Experimental Design

Ablations (Core to the Method-Based Goal)

For Rainbow-style components, we will explicitly answer:

We will present ablation results as:

Failure Mode Analysis

Beyond metrics, we will document failure modes with concrete artifacts:

Tools & Resources

Team Meetings

📅 January 22nd, 2026 @ 1:00 PM PST
Zoom Meeting Link