##openfpga on 2017-07-12 — irc logs at freenode.irclog.whitequark.org

00:46 azonenberg_work has joined ##openfpga

00:47 amclain has quit [Quit: Leaving]

01:02 azonenberg_work has quit [Ping timeout: 260 seconds]

01:54 * azonenberg is respinning the board tonight

02:20 _whitelogger has joined ##openfpga

02:32 digshadow has joined ##openfpga

04:02 m_w has quit [Quit: leaving]

04:26 _whitelogger has joined ##openfpga

04:46 laintree has quit [Ping timeout: 246 seconds]

04:53 digshadow has quit [Quit: Leaving.]

04:53 digshadow1 has joined ##openfpga

04:53 laintree has joined ##openfpga

04:54 laintree is now known as laintoo

05:23 SpaceCoaster has quit [Ping timeout: 240 seconds]

05:35 eduardo__ has joined ##openfpga

05:39 eduardo_ has quit [Ping timeout: 255 seconds]

05:44 SpaceCoaster has joined ##openfpga

05:50 mifune has joined ##openfpga

05:50 cyrozap has quit [Quit: Client quit]

05:51 cyrozap has joined ##openfpga

05:58 mifune has quit [Ping timeout: 260 seconds]

06:03 cyrozap has quit [Quit: Client quit]

06:03 cyrozap has joined ##openfpga

06:04 Hootch has joined ##openfpga

06:12 fpgacraft1 has quit [Quit: ZNC 1.7.x-git-709-1bb0199 - http://znc.in]

06:12 fpgacraft1 has joined ##openfpga

06:17 cyrozap has quit [Quit: Client quit]

06:18 cyrozap has joined ##openfpga

06:26 cyrozap has quit [Quit: Client quit]

06:27 cyrozap has joined ##openfpga

06:31 cyrozap has quit [Client Quit]

06:32 cyrozap has joined ##openfpga

06:39 balrog has quit [Ping timeout: 246 seconds]

06:51 balrog has joined ##openfpga

07:05 <azonenberg> welp, almost done redoing the gp4 thermal breakout...

07:13 scrts has quit [Ping timeout: 268 seconds]

07:16 scrts has joined ##openfpga

07:20 <openfpga-github> [openfpga] azonenberg pushed 1 new commit to master: https://git.io/vQyv6

07:20 <openfpga-github> openfpga/master d2cc6f5 Andrew Zonenberg: Updated gp4-stqfn-thermal with correct pinout for 20-pin header (PCB rev 0.2)

07:26 sgstair has quit [Quit: .•«UPP»•.]

07:29 <openfpga-github> [openfpga] azonenberg pushed 1 new commit to master: https://git.io/vQyfg

07:29 <openfpga-github> openfpga/master 0a23f0e Andrew Zonenberg: Updated schematic for v0.2 thermal board

07:37 cr1901_modern has quit [Quit: Leaving.]

07:39 sgstair has joined ##openfpga

07:43 scrts has quit [Ping timeout: 258 seconds]

07:47 scrts has joined ##openfpga

07:48 jn__ has quit [Ping timeout: 276 seconds]

07:54 jn__ has joined ##openfpga

10:14 scrts has quit [Ping timeout: 255 seconds]

10:17 scrts has joined ##openfpga

10:18 seu_ is now known as seu

12:28 <pie_> azonenberg, omg reqorkctf will be availible from oshpark? wooooo *3*

12:28 <pie_> azonenberg, any idea about rough price?

12:37 fpgacraft2_ has joined ##openfpga

12:41 uelen has quit [Quit: No Ping reply in 180 seconds.]

12:41 fpgacraft2 has quit [Ping timeout: 246 seconds]

12:41 fpgacraft2_ is now known as fpgacraft2

12:41 uelen has joined ##openfpga

12:48 clifford_ has joined ##openfpga

12:48 clifford has quit [Read error: Connection reset by peer]

13:03 Hootch has quit [Read error: Connection reset by peer]

13:35 scrts has quit [Ping timeout: 255 seconds]

13:38 scrts has joined ##openfpga

14:27 pie_ has quit [Remote host closed the connection]

14:29 pie_ has joined ##openfpga

14:29 pie__ has joined ##openfpga

14:30 pie__ has quit [Remote host closed the connection]

15:22 m_w has joined ##openfpga

15:58 amclain has joined ##openfpga

16:10 digshadow1 has quit [Ping timeout: 260 seconds]

16:12 cr1901_modern has joined ##openfpga

16:13 [X-Scale] has joined ##openfpga

16:15 m_w has quit [Quit: leaving]

16:16 mifune has joined ##openfpga

16:17 X-Scale has quit [Ping timeout: 255 seconds]

16:17 [X-Scale] is now known as X-Scale

16:18 m_w has joined ##openfpga

16:42 mifune_ has joined ##openfpga

16:44 pim__ has joined ##openfpga

16:45 mifune has quit [Ping timeout: 240 seconds]

16:48 mifune_ has quit [Ping timeout: 246 seconds]

16:53 Hootch has joined ##openfpga

17:05 digshadow has joined ##openfpga

17:06 pim__ has quit [Quit: Leaving]

17:06 mifune has joined ##openfpga

17:13 scrts has quit [Ping timeout: 240 seconds]

17:17 scrts has joined ##openfpga

17:45 abetusk has joined ##openfpga

17:51 <abetusk> sorry for the noob question, but openfpga is supposed to be a toolchain for going from VHDL/Verilog to binary files and to help with programming fpgas?

17:51 <azonenberg> abetusk: This channel is a general discussion forum for anything to do with open source software for targeting FPGAs

17:51 <azonenberg> If you have a question about how to use vhdl/verilog or a chip-specific feature that isn't toolchain dependent ##fpga is the better bet

17:52 <azonenberg> Anything related to using or developing open tools is on topic here

17:53 <azonenberg> We normally use Yosys (which has its own channel, #yosys) for synthesis, which is going from HDL source code to a gate-level circuit description with no placement information

17:53 <azonenberg> As of now Yosys only supports Verilog, rqou is working on a VHDL front end but it's far from usable

17:53 <azonenberg> Once you have the netlist from Yosys you can feed it into either Icestorm (for lattice ice40), gp4par (for silego greenpak4), or cr2par (for xilinx coolrunner-2)

17:54 <azonenberg> the latter two tools both live in the azonenberg/openfpga repository and are developed by folks here

17:54 <azonenberg> while talking about icestorm is on topic here, i think only one of the developers is in this channel

17:55 <azonenberg> abetusk: That answer your question?

17:56 <abetusk> azonenberg, yes, thank you. I keep flirting with the idea with getting into fpga programming but there's a big barrier to entry. I would rather invest in some FOSS toolchain if possible. That information is great, thanks again

17:57 <azonenberg> So as of now there's two main toolchains that i would consider in a usable state

17:57 <azonenberg> icestorm and gp4par

17:57 <azonenberg> cr2par is not yet well supported/tested enouhg for me to recommend you use unless you're trying to get involved as a toolchain developer

17:58 <azonenberg> icestorm works on smallish FPGAs that go up to i think 8000 LUTs? i havent used it myself, kept meaning to play with it

17:58 <azonenberg> gp4par works on the silego greenpak4 line, which are super tiny (up to 26 LUTs) but very cheap (0.3 USD each in volume), have internal OTP configuration memory, and have a bunch of analog/digital hard IP cores

17:59 <azonenberg> They're meant for glue logic type applications, io expansion, reset sequencing, power rail monitoring, etc

17:59 <azonenberg> Depending on what you want to do i'd recommend one or the other

17:59 <abetusk> do you have any recommendations for just general learning? The icestorm?

18:00 <azonenberg> oh, and cr2par targets xilinx coolrunner-2 which are kind of in between, they go up to 512 macrocells (but that one is quite expensive, 256 is the biggest i'd recommend playing with)

18:00 <azonenberg> they're an older xilinx family

18:00 <azonenberg> no fancy peripherals

18:00 <azonenberg> but way faster than greenpak

18:01 <azonenberg> I would say icestorm is probably best for general learning fpgas

18:01 <azonenberg> then if you have a simple project that fits in a greenpak you can probably save cash and pcb space using them

18:02 <abetusk> and in terms of verilog/vhdl libraries...is there an equivalent of a GitHub somewhere for those?

18:03 <azonenberg> Some people have HDL modules (generally referred to as "IP cores") on github

18:03 <azonenberg> there's also opencores, which in my experience is awful code quality

18:03 <azonenberg> I may be a bit odd in this regard but i do not recommend using 3rd party IP when starting out

18:03 <azonenberg> it's a great way to connect black boxes and not understand anything thats going on :p

18:04 <abetusk> ok

18:04 <azonenberg> Once you know enough to build the core yourself, if you want to use somebody else's code to save development/debug time you can do so

18:04 <azonenberg> but until then it will probably just be a big footgun

18:05 <azonenberg> Start out by building a simple uart or something

18:05 <abetusk> well, the hello world is usually a blinking light..

18:05 <azonenberg> Well yeah but i meant your first nontrivial project

18:06 <balrog> build a stopwatch that can count up and down

18:06 <balrog> :P

18:06 <abetusk> hehe, right.

18:06 <azonenberg> Lol i'd start with a uart

18:06 <abetusk> azonenberg, thanks again, this is great information.

18:06 <azonenberg> make something that echoes everything you type rot13'd or something

18:06 <balrog> then add reset and split

18:06 <azonenberg> balrog: that requires either a bunch of 7seg displays or a uart

18:07 <balrog> then add multiple split times where a button cycles through

18:07 <azonenberg> i dont think any ice40 devkits have that?

18:07 <lain> just display the raw binary counter on leds

18:07 <lain> :3

18:07 <balrog> azonenberg: yeah... some devkits have 7seg displays, but that's a good point, dunno if ice40 ones do

18:07 <azonenberg> except uart which is easier to start with

18:13 scrts has quit [Ping timeout: 276 seconds]

18:14 scrts has joined ##openfpga

18:16 <digshadow> azonenberg: this proprietary FSDB code is a POS

18:16 <digshadow> they print *everything* to stderr

18:16 <digshadow> main returns TRUE/FALSE

18:16 <digshadow> and other oddities

18:17 <digshadow> if (TRUE != RunShittyTest()) fprintf(stderr, "test passed\n");

18:17 <digshadow> and wtf is that

18:18 <digshadow> ugh

18:18 <azonenberg> wuuut?

18:18 <digshadow> their function names also start with __

18:19 <azonenberg> This is the flat file streaming database?

18:19 <digshadow> so it was actually

18:19 <azonenberg> or something else

18:19 <digshadow> if (TRUE != __RunShittyTest()) fprintf(stderr, "test passed\n");

18:19 <digshadow> Fast Signal Database

18:19 <digshadow> a proprietary version of vcd

18:20 <azonenberg> ooof

18:20 <azonenberg> is there a converter tool?

18:20 <digshadow> yeah it doesn't work

18:20 <digshadow> which is why I'm now trying to figure out if I can use the library directly

18:21 <digshadow> it gets confused on some of the types

18:31 <openfpga-github> [yosys] azonenberg pushed 3 new commits to master: https://git.io/vQSlb

18:31 <openfpga-github> yosys/master 4a8c131 Clifford Wolf: Fix the fixed handling of x-bits in EDIF back-end

18:31 <openfpga-github> yosys/master 10c7709 Clifford Wolf: Generate FSM-style testbenches in smtbmc

18:31 <openfpga-github> yosys/master 479be3c Clifford Wolf: Fix handling of x-bits in EDIF back-end

18:43 scrts has quit [Ping timeout: 255 seconds]

18:46 scrts has joined ##openfpga

19:13 scrts has quit [Ping timeout: 240 seconds]

19:19 <awygle> azonenberg: re: your massively parallel place-and-routing ambitions, do you have a particular objection to GPUs?

19:24 <azonenberg> awygle: yes, they are awful at branching

19:24 <azonenberg> plus one GPU still wouldn't be enough for the kind of stuff i'm talking about

19:26 <awygle> mk

19:26 <balrog> they're awful at divergence, not branching, so if you have large blocks that branch in the same way you might still benefit

19:27 <balrog> azonenberg: what are you trying to do?

19:27 scrts has joined ##openfpga

19:27 <balrog> cyrozap, pointfree: wanted to ask, how are psoc things going?

19:30 <azonenberg> balrog: Hypothesizing about creating a highly scalable P&R

19:30 <azonenberg> Dream is something that can PAR a full kintex ultrascale in tens of seconds to a few minutes on 1024+ x86 cores

19:31 <azonenberg> it's currently nothing more than a dream, i've done no real research into it

19:31 <azonenberg> other than finding casual similarities between molecular dynamics and PAR

19:31 <azonenberg> and knowing MD scales fairly well

19:31 <azonenberg> to 100k's of cores

19:43 scrts has quit [Ping timeout: 240 seconds]

19:44 scrts has joined ##openfpga

19:59 azonenberg_work has joined ##openfpga

20:04 Hootch has quit [Quit: Leaving]

21:11 wpwrak has quit [Read error: Connection reset by peer]

21:11 wpwrak has joined ##openfpga

21:22 <rqou> azonenberg: i found a paper about gpu-accelerated placement using simulated annealing

21:22 <rqou> the key insight was: you don't need to synchronize all the time and get exactly the same answer as what a cpu will get

21:23 <rqou> as long as your placements stay legal, you can produce some "slightly incorrect" answers

21:23 <rqou> and as long as you anneal enough it's still "good enough"

21:24 <cr1901_modern> why aren't "legal" and "slightly incorrect" mutually exclusive?

21:24 <rqou> e.g. if you accepted a swap when you shouldn't have (because it makes the score worse)

21:24 <rqou> but all sites are still valid and there aren't e.g. two cells in the same spot

21:24 <cr1901_modern> Ahhh, cool thanks

21:25 <balrog> rqou: link the paper please? :)

21:26 <rqou> hmm can't find it right now

21:27 <rqou> alright, here it is: https://pdfs.semanticscholar.org/39da/c0c9dbf3ac198df9e2b79fd51a9f7d938eb1.pdf

21:29 <rqou> alright, now i need to rush and finish up my slides for mtvre

21:51 <awygle> since azonenberg put the bug in my ear i've been amassing a reasonable collection of "fpga placement but faster/more parallel" papers (including that one)

21:51 <awygle> population annealing is very interesting

21:52 <rqou> azonenberg doesn't want to move in that direction though

21:52 <rqou> iirc he wanted to look into quadratic-wirelength algorithmms

21:52 <rqou> e.g. this one (paywall) http://ieeexplore.ieee.org/document/1515784/

21:53 <awygle> yeah there are some cool analytical options too

21:53 <rqou> are you good at algorithms? code some of these for us pl0x :P

21:53 <awygle> i have a paper on GPU-accelerated star+ that i want to dive into

21:53 <rqou> test with ice40 or something

21:53 m_w has quit [Quit: Leaving]

21:53 <awygle> i've been thinking about inventing a "fake" 80,000 LUT ice40 and swapping out the "place" part of arachne-pnr

21:54 <rqou> i personally would start with just a real ice40 8k

21:54 <awygle> i'd want to do that as well, but i figure parallelism speedups would be more obvious at larger sizes

21:54 <awygle> real 8k for correctness

21:57 mifune has quit [Ping timeout: 276 seconds]

21:58 <rqou> awygle: another thing to try would be to port the academic ice40 toolkit to work for ice40

21:59 <awygle> rqou: the "academic ice40 toolkit"? not familiar

21:59 <rqou> sorry, "the academic vpr toolkit"

21:59 <awygle> ahh

22:00 <awygle> yeah that would be useful because it's highly swappable as i understand it, making it potentially a better platform for experimentation

22:01 <awygle> but i think that requires deeper knowledge of the chip(s) than i currently have

22:01 <rqou> http://www.clifford.at/icestorm/logic_tile.html

22:01 <awygle> working on it :)

22:01 <rqou> although these docs are nice and confusing

22:22 <azonenberg> rqou: i saw the same paper

22:22 <rqou> which one? gpgpu or qpf?

22:23 <azonenberg> the gpgpu one

22:23 <lain> gpgpgpgpgpgpgpgpgpi

22:23 <lain> ... s/i/u/

22:23 <lain> I fucked it up

22:23 <azonenberg> But annealing still doesnt scale as well as i want, i want to go analytic

22:23 <azonenberg> And again i want to scale to looots of cores

22:23 <azonenberg> not one gpu

22:23 <azonenberg> more like a couple racks of ec2 spot instances or something

22:23 <rqou> just pretend each cpu is like a gpu alu :P

22:23 <rqou> we can even arrange them into "warps" :P :P :P

22:24 <rqou> (yeah i know, everyone hates warps)

22:24 <rqou> lain: i'm waiting for it to go full-circle so that someone has the "bright idea" to make a "special-purpose GPU" just for graphics

22:27 mifune has joined ##openfpga

22:27 <lain> hahahah

22:32 <awygle> azonenberg: you can get 16 K80s on one AWS instance now, and i think cluster those instances as well but i'm not too familiar with that

22:32 <awygle> that's 40,000+ cores per instance (if you believe the marketing which you shouldn't), plus you also get a big win for offline builds

22:34 <qu1j0t3> [if you have a budget that is]

22:34 kristianpaul has quit [Remote host closed the connection]

22:35 kristianpaul has joined ##openfpga

22:35 <awygle> $14.40 an hour. if you meet the 10sec target that's 4 cents per build, on an on-demand instance not a spot. i'm not super familiar with AWS pricing generally, would it really be cheaper on CPUs?

22:35 <awygle> i guess we're starting to make stuff up at this point

22:48 <rqou> plz 2 code us some algorithms kthx :P

23:06 mifune has quit [Ping timeout: 248 seconds]

23:16 wpwrak has quit [Read error: Connection reset by peer]

23:17 wpwrak has joined ##openfpga

23:19 <azonenberg> Lol we'll have to prototype :p

23:19 <azonenberg> i have no idea what the actual number of core-hours required

23:19 <rqou> step 0: get "that gsoc guy" to finish fixing up the ice40 docs

23:19 <rqou> so that someone else can actually understand them

23:19 <azonenberg> Lower bound, assuming linear scaling: fairly well packed 7a100t in ~1 hr -> 4 core-hours

23:20 <azonenberg> so on 128 cores that'd be 1 min 52 sec

23:20 <azonenberg> or 14 sec on 1024 cores

23:20 GenTooMan has joined ##openfpga

23:21 <azonenberg> This assumes no non-parallel setup/cleanup and no scaling overhead, as well as no optimizations that make our code cleaner than ISE's nightmare

23:21 <azonenberg> But it should give an OOM estimate of what the best plausible performance we could get under ideal conditions with a perfectly scaling algorithm would be

23:22 <rqou> hmm i wonder

23:22 <rqou> is ise's PAR completely generic?

23:22 <rqou> is _is_ very data-driven

23:22 <azonenberg> you're wondering about copying vivado data files into ise?

23:22 <azonenberg> it would probably be possible to *port* but i dont think its 100% generic

23:22 <rqou> i was thinking the other direction

23:22 <azonenberg> oh

23:23 <azonenberg> i dont know anything about vivado's guts

23:24 <lain> you don't want to know

23:25 <azonenberg> Lol i figured :p

23:30 <rqou> i've heard that it's obsessed with tcl

23:30 <azonenberg> Yes, but so is 90% of the modern EDA world

23:31 <rqou> btw, quality c code right here: https://github.com/rqou/laughing-waffle/blob/master/fpga/dump/main.c

23:32 <azonenberg> lol

23:32 <azonenberg> wut

23:32 <rqou> what? :P

23:33 <rqou> isn't this "vu16" nonsense the "classic" way to write embedded c garbage? :P

23:34 <rqou> anyways, time for me to shower and start heading over to mtvre

23:58 <azonenberg> Welp

23:58 <azonenberg> The plot thickens

23:59 <azonenberg> in greenpak devices it's possible for one net num to correspond to multiple logical HDL ports depending on device configuration

23:59 <azonenberg> Hmmm

23:59 <pie_> azonenberg, https://media.giphy.com/media/i2j51OF1D2t0c/200.gif

23:59 <azonenberg> lol