##openfpga on 2024-04-10 — irc logs at libera.irclog.whitequark.org

2021-06-23 14:13 azonenberg changed the topic of ##openfpga to: Open source tools for FPGAs, CPLDs, etc. Silicon RE, bitfile RE, synthesis, place-and-route, and JTAG are all on topic. Channel logs: https://libera.irclog.whitequark.org/~h~openfpga

00:29 Degi_ has joined ##openfpga

00:29 Degi has quit [Ping timeout: 260 seconds]

00:29 Degi_ is now known as Degi

00:29 cr1901_ has joined ##openfpga

00:33 cr1901 has quit [Ping timeout: 268 seconds]

00:46 <pie_> is there any consensus on whether xilinx or altera have "better" architectures?

00:47 <mwk> xilinx.

00:47 <mwk> fuck altera with a chainsaw.

00:48 <mwk> <whitequark> it should be a rusty chainsaw

01:23 <pie_> heh

01:23 <pie_> why?

01:28 <pie_> From the aforementioned review article, and also I guess justifying my question of if there have been any (major) changes, I guess this kind of explains FPGAs showing-up-ish in hpc;

01:28 <pie_> "More recently, FPGAs have been widely deployed in datacenters to accelerate various types of workloads such as search engines and network packet processing [9]. In addition, DL has emerged as a key component of many applications both in datacenter and edge workloads, with MAC being its core arithmetic operation. Driven by these new trends, the DSP block architecture has evolved in two different directions. The first direction targets the high-

01:28 <pie_> performance computing (HPC) domain by adding native support for single-precision floating-point (fp32) multiplication. Before that, FPGA vendors would supply designers with IP cores that implement floating-point arithmetic out of fixed-point DSPs and a considerable amount of soft logic resources. This created a huge barrier for FPGAs to compete with CPUs and GPUs (which have dedicated floating-point units) in the HPC domain. Native floating-point

01:28 <pie_> capabilities were first introduced in Intel’s Arria 10 architecture, with a key design goal of avoiding a large increase in DSP block area [79]. "

01:28 <pie_> "The second direction targets increasing the density of low-precision integer multiplication specifically for DL inference workloads."

01:28 <pie_> Though I still wonder about competing with GPUs or dedicated ML inference chips.

02:02 <pie_> yeesh;

02:02 <pie_> "For example, a single channel of high-bandwidth memory (HBM) has a 128-bit double data rate interface operating at 1 GHz, so a bandwidth-matched soft bus running at 250 MHz must be 1024 bits wide. With recent FPGAs incorporating up to 8 HBM channels [91] as well as numerous PCIe, Ethernet and other interfaces, system level interconnect can rapidly use a major fraction of the FPGA logic and routing resources. In addition, system-level interconnect

02:02 <pie_> tends to span large distances. The combination of very wide and physically long buses makes timing closure challenging and usually requires deep pipelining of the soft bus, further increasing its resource use. The system-level interconnect challenge is becoming more difficult in advanced process nodes, as the number and speed of FPGA external interfaces increases, and the metal wire parasitics (and thus interconnect delay) scales poorly [92]."

02:12 <pie_> Apparently leading to network-on-chips

02:12 <pie_> > Recent Xilinx (Versal) and Achronix (Speedster7t) FPGAs integrate a hard NoC [102], [103] similar to the academic proposals discussed above.

02:13 <sorear> I wonder if any of the "make it easier to do GPU and ML stuff" is actually "make it easier for GPU and ML companies to use our products for pre-silicon validation". there are enough ML accelerator startups that it might not be a small market

07:11 sgstair has quit [Ping timeout: 268 seconds]

08:37 sgstair has joined ##openfpga

11:29 schaeg has joined ##openfpga

12:23 schaeg has quit [Ping timeout: 255 seconds]

12:37 Flea86 has quit [Ping timeout: 255 seconds]

12:39 Flea86 has joined ##openfpga

13:42 mewt has quit [Read error: Connection reset by peer]

13:52 mewt has joined ##openfpga

13:56 <pie_> sorear: I was kind of wondering about that but I have no idea of the economics so I kind of discounted that as a viable option

13:57 <pie_> is it worth it to tape out an fpga due to hype cycle? at what point does it become worth it when asics are coming?

15:25 emeb_mac has joined ##openfpga

15:53 cr1901_ is now known as cr1901

19:44 <cr1901> pie_: Could you relink the article you're quoting?

19:57 <pie_> yeah one sec

19:58 <pie_> <pie_> afaict, good (introductory?) fpga review article on some basic internals of fpga architectures from 2021 from Andrew Boutros and Vaughn Betz (dunno if these are known names, at least the former seems to be part of VTR which is mentioned in this article and i think may come up here, or maybe thats VPR - well apparently vpr is part of vtr); https://www.eecg.utoronto.ca/~vaughn/papers/casm2021_arch_survey.pdf

20:06 <cr1901> tyvm

20:12 <pie_> \o/

20:46 <sorear> neat review article, would be nice if mentioned more companies

20:49 <pie_> i would ask if anyone else matters but they must because they stay afloat somehow

20:49 <pie_> and i suppose lattice is a given in this channel

20:49 <pie_> and I guess now that you mention it, i wonder if there is somethng on lattice?

20:51 <sorear> lattice, siliconblue, wasn't ice40 also alcatel/lucent at some point?

20:51 <sorear> then there's the actel^Wmicrosemi^Wmicrochip designs which replace all of the "SRAM cells" in the review article with flash transistors, could be important to mention

20:52 <mwk> ice40 was siliconblue, bought by lattice

20:52 <mwk> all the rest of lattice chips were at&t, alcatel/lucent, then lattice

20:52 <mwk> they're two technologically unrelated lineages

20:56 <cr1901> I read in Xilinx app notes that Spartan3E multipliers have configurable polarities for CLK and RST, but there's no parameter for the Verilog primitives to control them. Is this undocumented behavior, or Just Plain False?

21:00 <sorear> til about nanoxplore, trying to find information on the gr765 efpga

21:01 <pie_> mwk: oh. huh.

21:05 <pie_> someone get this database mirrored https://so-logic.net/en/knowledgebase/fpga_universe

21:06 <pie_> no idea if its any good, you haveto make an account and get approved to access it i think

21:06 <pie_> funny because i found it on some wikipedia article

21:07 <pie_> ah no actually I think Im remembering wrong, I dont think I actually tried to register.

21:16 * pie_ regresses to the mean

21:19 <mwk> cr1901: it's true

21:20 <mwk> and the way to access them is by just putting an inverter in front of the clock or reset

21:20 <mwk> ISE will merge them into the primitive

21:20 <mwk> (this is true in general for all invertible pins on Xilinx FPGAs)

21:20 <mwk> (though being sure exactly which one they are is ... tricky)

21:26 <cr1901> Oh FFS... it's the obvious thing to try, but it's not good practice at all, so I didn't try it