Projects

An efficient recurrent neural network (RNN) inference chip design for MIMO OFDM symbol detection

This design presents an energy efficient ANN accelerator RTL design which deploys MAC-tanh operations leveraging DSP48E1 IP in Virtex VC-707 FPGA for MIMO OFDM symbol detection

Project Overview

Implementation on VC-707 FPGA :

  • HDL Coding : VHDL + Verilog
  • Important IPs: Tri-mode-ethernet-MAC (TEMAC), UART, DSP48E2
  • LUT resource: 13314
  • FF count: 10750
  • BRAM count : 6
  • Static Power Consumption: 262 mW
  • Dynamic Power Consumption: 256 mw
View Project on GitHub

RTL Design of a Custom Graphics Processing Unit (GPU) with Frame Buffer and Pixel Controller for FPGA-Based Game Rendering

This project implements RTL SystemVerilog design of a custom graphics processor pipeline, with real-time VGA signal generation and spriteROM rendering through FSM-based control logic to create a turn based battle game with menu based move selection.

Project Overview

  • HDL coding: SystemVerilog
  • Protocol: VGA-ADC
  • Frequency -VGA clock: 25.175 MHz
  • Pixel resolution {R,G,B}: 24 bits
  • Logic resource (ALM): 214
  • Distributed FF count: 31
  • Total Block memory size (Bytes): 38,400
  • Total power estimate: 424.55 mw
View Project on GitHub

Neuromorphic SNN accelerator design with biologically inspired 'On-Chip' training for Edge-AI application

This design presents an on-chip spiking neural network (SNN) neuromorphic accelerator design deploying biologically inspired training for low power Edge-AI classification tasks.

Project Overview

Implementation on VC-707 FPGA :

  • HDL coding : SystemVerilog
  • LUT resource: 3488
  • FF count: 3029
  • Static Power Consumption: 108 mW
  • Dynamic Power Consumption: 216 mw
  • Maximum Operating Frequency: 118 MHz
View Project on GitHub

Hardware–software co-design: A complex polynomial series on-board solver design by PS-PL integration

This design utilizes the NIOS II soft-core CPU (PS) to compute complex Maclaurin series expansions, while the FPGA fabric (PL) handles real-time sample delivery and result capture for seamless on-board hardware-software integration.

Project Overview

  • PS CPU spec: NIOS II
  • FPGA PL fabric: Cyclone V
  • PS programming: C
  • PL HDL: SystemVerilog
  • Block memory size (Bytes): 31920
  • Logic resource (ALM): 1704
  • Distributed FF count: 2818
  • Total power estimate: 449.06 mw
View Project on GitHub

RTL Design & verification of 32 bit MIPS single cycle CPU for R and I type instructions

The project features a 32-bit MIPS CPU designed to execute instructions in a single clock cycle. It is built from scratch using Verilog. The processor is inspired by MIPS/RISC-V architecture principles and supports a subset of R-type and I-type instructions, consisting of core processor components and essential operations like instruction fetching, arithmetic computations, and memory access.

Project Overview

  • HDL programming: Verilog
  • Target device: Zynq-7000 ZC-702
  • Block RAM: 1.5
  • LUT count: 378
  • Distributed FF count: 94
  • Total power estimate: 114 mw
View Project on GitHub

RTL Implementation & FPGA-Proven Verification of a Direct-Mapped Cache-RAM System with FSM Control Logic

This project involves the RTL design and verification of a direct-mapped cache memory system integrated with RAM and a control unit, implemented using Xilinx Vivado. The system supports a 15-bit address input, managing a 128-bit cache line with 1024 entries.

Project Overview

  • HDL programming: Verilog
  • Block RAM: 1.5
  • LUT count: 5358
  • Distributed FF count: 4098
  • Total power estimate: 151 mw
View Project on GitHub