Coded Aperture Snapshot Spectral Imaging (CASSI)
CASSI captures a 3D hyperspectral data cube (2 spatial + 1 spectral dimension) in a single 2D camera exposure. The scene is modulated by a binary coded aperture mask, spectrally dispersed by a prism, and integrated onto a 2D detector. The forward model is y = H*x + n where H encodes both coded-aperture modulation and spectral-dispersion shift. Compression ratios equal the number of spectral bands (e.g. 28:1). Reconstruction exploits spectral correlation via GAP-TV, MST, or CST.
Coded Aperture Dispersion
Gaussian
mst
CMOS
Forward-Model Signal Chain
Each primitive represents a physical operation in the measurement process. Arrows show signal flow left to right.
M(mask) → W(α, a) → Σ_λ → D(g, η₄)
Benchmark Variants & Leaderboards
CASSI
Coded Aperture Snapshot Spectral Imaging
M(mask) → W(α, a) → Σ_λ → D(g, η₄)
Standard Leaderboard (Top 10)
| # | Method | Score | PSNR (dB) | SSIM | Trust | Source |
|---|---|---|---|---|---|---|
| 🥇 | MiJUN-5stg | 0.927 | 40.9 | 0.991 | ✓ Certified | Meng et al. AAAI 2025 |
| 🥈 | RDLUF-MixS2-9stg | 0.904 | 39.6 | 0.988 | ✓ Certified | Dong et al. CVPR 2023 |
| 🥉 | DAUHST-9stg | 0.883 | 38.4 | 0.985 | ✓ Certified | Cai et al. NeurIPS 2022 |
| 4 | PADUT-3stg | 0.854 | 36.95 | 0.975 | ✓ Certified | Li et al. ICCV 2023 |
| 5 | CST-L-Plus | 0.836 | 36.1 | 0.967 | ✓ Certified | Cai et al. ECCV 2022 |
| 6 | MST++ | 0.833 | 36.0 | 0.966 | ✓ Certified | Cai et al. CVPRW 2022 |
| 7 | MST-L | 0.809 | 34.81 | 0.958 | ✓ Certified | Cai et al. CVPR 2022 |
| 8 | HDNet | 0.804 | 34.66 | 0.952 | ✓ Certified | Hu et al. CVPR 2022 |
| 9 | SSR-L | 0.797 | 34.0 | 0.960 | ✓ Certified | Zhang et al. CVPR 2024 |
| 10 | DGSMP | 0.752 | 32.6 | 0.917 | ✓ Certified | Huang et al. CVPR 2021 |
Showing top 10 of 18 methods. View all →
Mismatch Parameters (8) click to expand
| Name | Symbol | Description | Nominal | Perturbed |
|---|---|---|---|---|
| mask_dx | Δx | Mask lateral shift (pixels) | 0 | 0.5 |
| mask_dy | Δy | Mask vertical shift (pixels) | 0 | 0.3 |
| mask_theta | θ | Mask rotation (rad) | 0 | 0.1 |
| disp_a1 | a₁ | Dispersion coefficient | 2.0 | 2.02 |
| disp_alpha | α | Dispersion angle (rad) | 0 | 0.15 |
| sigma_read | σ_r | Detector read noise std (electrons) | 5.0 | 8.0 |
| dark_current | I_d | Dark current (electrons/pixel/s) | 0.1 | 0.5 |
| gain | g | Detector gain multiplier | 1.0 | 1.03 |
Reconstruction Triad Diagnostics
The three diagnostic gates (G1, G2, G3) characterize how reconstruction quality degrades under different error sources. Each bar shows the relative attribution.
Model: coded aperture dispersion — Mismatch modes: mask misalignment, dispersion curve error, spectral response drift, defocus
Noise: gaussian — Typical SNR: 20.0–40.0 dB
Requires: coded aperture mask, dispersion curve, dark frame, spectral response
Modality Deep Dive
Principle
Coded Aperture Snapshot Spectral Imaging (CASSI) captures a full 3-D spectral datacube (x, y, λ) in a single 2-D snapshot by encoding the scene with a binary coded aperture and spectrally dispersing it with a prism onto the detector. Different spectral channels are shifted and superimposed on the sensor, creating a compressed measurement. Computational algorithms recover the full datacube from this single measurement using sparsity priors.
How to Build the System
Build an optical relay with an objective lens, place a binary coded aperture (lithographic chrome-on-glass mask or DMD) at an intermediate image plane, then disperse with an Amici or double-Amici prism, and re-image onto a high-resolution detector (2048× 2048+ pixels). Precisely calibrate the spectral dispersion curve (nm/pixel). The coded aperture pattern should have ~50 % transmittance and good conditioning.
Common Reconstruction Algorithms
- TwIST (Two-step Iterative Shrinkage/Thresholding)
- GAP-TV (Generalized Alternating Projection with Total Variation)
- ADMM with sparsity in DCT or wavelet domain
- Deep unfolding networks (DGSMP, TSA-Net, BIRNAT)
- Plug-and-Play ADMM with learned denoisers
Common Mistakes
- Poor spectral calibration causing wavelength assignment errors across the datacube
- Coded aperture not precisely at the image plane, blurring the code modulation
- Insufficient detector resolution relative to the number of spectral bands
- Ignoring optical aberrations in the dispersive relay that vary with wavelength
- Using a random mask without checking its sensing matrix condition number
How to Avoid Mistakes
- Calibrate spectral mapping with monochromatic sources at known wavelengths
- Mount coded aperture on a precision z-stage and focus to maximize modulation contrast
- Ensure detector pixel count > (spatial pixels × spectral bands) for adequate compression ratio
- Design the relay optics for uniform imaging quality across the spectral range
- Optimize or simulate the mask pattern for low coherence (good RIP) before fabrication
Forward-Model Mismatch Cases
- The widefield fallback produces a 2D (64,64) grayscale image, but CASSI compresses a 3D spectral datacube (64,64,L wavelengths) into a single 2D coded snapshot via a binary mask and dispersive prism — the spectral dimension is entirely absent
- Without the coded aperture mask and spectral dispersion, the measurement does not encode wavelength-dependent information — spectral unmixing or hyperspectral reconstruction from the fallback output is impossible
How to Correct the Mismatch
- Use the CASSI operator that applies the binary coded aperture mask followed by spectral dispersion (prism/grating shift), producing a 2D coded measurement that encodes the full 3D spectral datacube
- Reconstruct the (x,y,lambda) datacube using compressive sensing (TwIST, GAP-TV) or deep unfolding networks (TSA-Net, MST) that exploit the spatio-spectral structure encoded by the CASSI forward model
Experimental Setup
Custom SD-CASSI / KAIST CASSI prototype
binary random mask on photolithography substrate
Amici prism (SD-CASSI)
28
450-650
256x256
28
FLIR Grasshopper3 monochrome CMOS (2048x2048)
4f relay system with 1:1 magnification
Signal Chain Diagram
Key References
- Wagadarikar et al., 'Single disperser design for coded aperture snapshot spectral imaging', Applied Optics 47, B44-B51 (2008)
- Cai et al., 'Mask-guided Spectral-wise Transformer (MST++)', CVPRW 2022
Canonical Datasets
- CAVE (Columbia, 32 scenes, 512x512x31)
- KAIST (30 scenes, 2704x3376x28)
- ARAD_1K (1000 hyperspectral images)
Modality Maintainers
Abraham
platformai gpt1
Chengshuai Yang Yang