Innova-2 Flex for AI?

mwrnd · June 7, 2023, 5:44pm

requires more than 64GB of RAM … no FPGA board

The Alveo U200 has 64GB of DDR4 and DPUCADF8H supports it. Also, AWS F1 Instances.

I think I should use the RAM and CPU on the host and
interact with Gemmini on FPGA through the RoCC interface

When you tested the demo project you got complete transfer times on the order of 100,000 nanoseconds.

** Avg time device /dev/xdma0_c2h_0, total time 163964 nsec,
** Avg time device /dev/xdma0_c2h_0, total time 107604 nsec,
** Avg time device /dev/xdma0_h2c_0, total time 118067 nsec,

This is due to the latency of PCIe and software/driver overhead. PCIe bandwidth is high but latencies are not great. DDR4 has complete transfer times on the order of 100ns. Your system will be very slow.

I have successfully built Gemmini hardware using this command:
cd chipyard/generators/gemmini
./scripts/build-verilator.sh

You built a Gemmini system for the Verilator simulator. This is a much better idea than trying to use the MNV303611A-EDLT. Simulate your software running on RISC-V+Gemmini. Simulation should be the first step in hardware design.

Topic		Replies	Views
Failed to burn an image onto Innova-2 Flex FPGA SoC And SmartNIC	15	1814	February 11, 2024
Benchmarking GPUDirect RDMA on Modern Server Platforms Technical Blog	40	2686	April 11, 2019
CUDA 4.0 CUDA Programming and Performance	63	507394	March 28, 2013
Real-time GPU processing Peer 2 peer data copy, Linux kernel memory, kernels in kernel, CUDA Programming and Performance	35	8088	June 30, 2010
LLaMa 2 LLMs w/ NVIDIA Jetson and textgeneration-web-ui Jetson Projects generative_ai	86	23556	May 10, 2024
What's new in Maxwell 'sm_52' (GTX 9xx) ? CUDA Programming and Performance	69	26910	December 23, 2014
Fermi? Sounds interesting... CUDA Programming and Performance	58	15507	October 18, 2009
Innova 2 FPGA PCIE Rescan Not Showing Up SoC And SmartNIC	7	868	November 2, 2023
From NIC to GPU. CUDA Programming and Performance	40	13555	February 12, 2011
Pipeline operator forwarding for integer instructions in CUDA CUDA Programming and Performance cuda , kernel	25	261	July 15, 2024

Innova-2 Flex for AI?

Related topics