#yosys on 2018-06-30 — irc logs at freenode.irclog.whitequark.org

2017-10-15 10:00 clifford changed the topic of #yosys to: Yosys Open SYnthesis Suite: http://www.clifford.at/yosys/ -- Channel Logs: https://irclog.whitequark.org/yosys

00:10 digshadow has quit [Ping timeout: 240 seconds]

00:21 promach_ has joined #yosys

00:41 digshadow has joined #yosys

00:51 dxld has quit [Quit: Bye]

00:52 dxld has joined #yosys

01:39 m_w has quit [Quit: Leaving]

02:00 <promach_> $global_clock is the same as smt_clk , right ?

02:01 <promach_> smt_clock

02:16 promach_ has quit [Quit: WeeChat 2.1]

02:16 promach_ has joined #yosys

02:55 <ZipCPU> emeb_mac: It's pipelined. That part isn't configurable. However, you can configure the size of the FFT, the number of bits in the input, the number of bits in the output, the number of multiplies used, whether or not the FFT is to accept two samples per clock, 1 sample per clock, 1 sample every two clocks, or 1 sample every three clocks.

02:55 <ZipCPU> Beyond that, the FFT is limited by your hardware ...

02:56 <ZipCPU> and by the fact that the updates I'm working through aren't (yet) working. So ... without the new updates that I'm working on, the FFT only does 2 samples per clock plus the other configurables.

03:06 <emeb_mac> ZipCPU: sounds pretty useful.

03:07 <ZipCPU> The 1 clock per sample and the 2 clocks per sample just passed my test at 2048 points! Yaaay ... (3 clocks per sample still fails)

03:07 <ZipCPU> Oh, and thanks!

03:08 <emeb_mac> the radio I'm working on now needs two 1024-pt 16-bit in/out transforms for the RX and a 2048 16 in/out on the TX.

03:08 <emeb_mac> we're using the Xilinx IP core for these and we've been pretty luck that they are working well

03:08 <emeb_mac> (other IP cores from Xilinx have turned out to be disasters)

03:11 <ZipCPU> ;) Yeah, the Xilinx cores seem to be reliable enough.

03:11 <ZipCPU> Once mine start working, they'll still need some tuning and optimization to match what Xilinx has done ... if I can do it at all.

03:12 <emeb_mac> ZipCPU: that's strong competition - they've got good performance and well optimized resource usage.

03:13 <ZipCPU> Exactly. Like I said, I don't know if I'll make a good showing in the end there, but at least what I have will work.

03:13 <emeb_mac> IIRC the 1kpt transforms we use require only 18 of the MAC cores

03:15 <ZipCPU> Let's see ... in the one I'm doing, you can tell it how many DSP cores you have. If you go full bore, 1 sample per clock, you'll need (10-2)*3 = 24 multiplies.

03:15 <ZipCPU> You could do it for less, at the cost of more LUT's.

03:16 <ZipCPU> On the other hand, if you want 2 clocks per sample, it would take (10-2)*2 = 16 multiplies, or if you want 3 clocks per sample it will take 8 multiplies.

03:16 <emeb_mac> That's not bad

03:16 <ZipCPU> On the other hand, if you are doing two samples per clock, then you'll want (10-2)*6 = 48 multiplies. It's all a tradeoff.

03:17 <ZipCPU> The soft multiply option isn't all that efficient though. I've got a slower option that's more efficient still, and I've thought of integrating that one in later.

03:18 <ZipCPU> The current soft multiply used is fully pipelined, so ... it requires a lot of flip flops and luts at every stage of the multiply.

03:20 <emeb_mac> Nice to have the option for soft multiplies

03:20 <ZipCPU> Yep!

03:20 <promach_> ZipCPU: which soft multiply are you referring to ?

03:21 <ZipCPU> What do you mean?

03:21 <promach_> you have your own multiply algo ?

03:21 <ZipCPU> Yes.

03:21 <emeb_mac> I try to avoid FPGAs w/o some hard multiplier resources for DSP stuff, but sometimes you gotta go with what's available

03:21 <promach_> wallace, Is uppose ?

03:21 <ZipCPU> It's a basic shift/add multiply, nothing fancy in this case.

03:21 <promach_> ok

03:22 <ZipCPU> I've built a wallace before, but ... the FFT doesn't use it.

03:27 <ZipCPU> Ok, 1, 2, and 3 clocks per sample now works using hardware multiplies, time to double check the soft multiplies

03:29 <emeb_mac> so does 1clk/sample allow continuous feed w/o any gaps?

03:30 <ZipCPU> Yes.

03:30 <ZipCPU> You can also feed it with unpredictable gaps too.

03:30 <emeb_mac> roughly what latency do you see from input to output?

03:31 <ZipCPU> Depends on the size of the FFT. Curious about a 1k FFT? I can go measure that.

03:31 <emeb_mac> yeah!

03:34 <ZipCPU> Looks like about 4176 clocks from the start of the first frame going in to the start of the first output frame.

03:34 AlexDaniel has quit [Read error: Connection reset by peer]

03:34 seldridge has quit [Ping timeout: 256 seconds]

03:34 AlexDaniel has joined #yosys

03:34 <ZipCPU> There's probably a couple clocks in there I could whittle out if latency was an issue, but that's what it is currently.

03:35 <emeb_mac> That's not bad.

03:35 <ZipCPU> Are you looking for low latency?

03:36 <emeb_mac> Generally yes - these radio designs tend to have fairly long datapaths with lots of things going on in them.

03:39 <ZipCPU> I'm not quite sure how I would, or if I would, redesign things for lower latency.

03:40 <emeb_mac> IIRC the cores we use have about 3k clocks latency. I don't think 4k would be a huge disadvantage tho

03:40 <ZipCPU> Hmm ... not sure where I'd find a full 1k latency from this design ....

03:41 <ZipCPU> Sure, there's a clock or two in each stage, but at ten stages that'd be at most 20 clocks.

03:41 <emeb_mac> Well, you're ahead of me. I've never thought too much about how to build an FFT.

03:43 <emeb_mac> about 20 years ago a guy I shared an office with architected one as a single-chip ASIC so I've only had peripheral exposure to it from discussing w/ him.

03:43 <ZipCPU> :)

03:43 <ZipCPU> I suppose I might go faster if I did something other than a Radix two FFT ...

03:43 * ZipCPU tugs at his beard

03:44 <emeb_mac> Aha - that must be it. Radix-4 was part of the optimization he did on his.

03:46 <ZipCPU> I might have to look into that in the future.

03:46 <ZipCPU> For now, I just want to get it running in the first place.

03:46 <ZipCPU> I'm pretty close, but ... not all cases work (yet)

04:03 ar3itrary has quit [Ping timeout: 276 seconds]

04:08 ar3itrary has joined #yosys

04:17 AlexDaniel has quit [Read error: Connection reset by peer]

04:17 AlexDaniel has joined #yosys

04:24 ar3itrary has quit [Ping timeout: 245 seconds]

04:31 ar3itrary has joined #yosys

04:48 <cr1901_modern> FFT was one of those things where I had to derive "how it works" exactly once and now I don't remember how to do it :(. I know you can split into bins by time or frequency (either works), but Idk if any way is better

05:26 emeb_mac has quit [Ping timeout: 265 seconds]

06:20 xerpi has joined #yosys

06:22 marbler has quit [Ping timeout: 240 seconds]

06:22 jfng has quit [Ping timeout: 240 seconds]

06:23 samayra has quit [Ping timeout: 245 seconds]

06:23 indefini has quit [Ping timeout: 245 seconds]

06:23 nrossi has quit [Ping timeout: 240 seconds]

06:23 lok[m] has quit [Ping timeout: 240 seconds]

06:23 swick has quit [Ping timeout: 240 seconds]

06:23 pointfree1 has quit [Ping timeout: 255 seconds]

06:23 Guest18568 has quit [Ping timeout: 256 seconds]

06:24 fevv8[m] has quit [Ping timeout: 276 seconds]

06:24 weebull[m] has quit [Ping timeout: 260 seconds]

06:26 cr1901_modern1 has joined #yosys

06:29 cr1901_modern1 has quit [Client Quit]

06:29 cr1901_modern has quit [Ping timeout: 245 seconds]

06:29 cr1901_modern1 has joined #yosys

06:29 cr1901_modern1 has quit [Client Quit]

06:30 cr1901_modern has joined #yosys

06:30 promach_ has quit [Ping timeout: 240 seconds]

06:31 promach_ has joined #yosys

06:38 cr1901_modern has quit [Read error: Connection timed out]

06:39 cr1901_modern has joined #yosys

07:36 samayra has joined #yosys

07:38 promach_ has quit [Ping timeout: 255 seconds]

08:20 Guest16831 has joined #yosys

08:20 lok[m] has joined #yosys

08:20 indefini has joined #yosys

08:20 nrossi has joined #yosys

08:20 marbler has joined #yosys

08:20 swick has joined #yosys

08:20 jfng has joined #yosys

08:20 fevv8[m] has joined #yosys

08:20 pointfree1 has joined #yosys

08:20 weebull[m] has joined #yosys

08:29 indy has quit [Ping timeout: 240 seconds]

08:40 promach_ has joined #yosys

09:51 pie_ has quit [Ping timeout: 260 seconds]

10:33 dys has joined #yosys

10:41 indy has joined #yosys

12:45 m_t has joined #yosys

14:55 emeb_mac has joined #yosys

14:57 <emeb_mac> ZipCPU: You've spoken before about the difficulty of applying formal to multipliers. Would it be a safe assumption that formal is generally not practical for DSP datapaths which rely heavily on math operations like multiplication / division / transformation?

14:57 <ZipCPU> Yes and no .... there are some ways around the problems.

14:58 <emeb_mac> I get the impression that the best way to apply formal in these types of designs is to partition complex control logic out and apply formal at the unit level.

14:58 <ZipCPU> I've had mixed success with data paths including multiplication or division.

14:58 * ZipCPU rummages through his designs for an example ....

14:59 <ZipCPU> Here's an example of an FIR filter (multiplies and all) where I manage to formally verify that the impulse response is correct using an abstract multiply: https://github.com/ZipCPU/dspfilters/blob/master/rtl/fastfir.v

15:00 <tpb> Title: dspfilters/fastfir.v at master · ZipCPU/dspfilters · GitHub (at github.com)

15:00 <ZipCPU> You might find the abstract multiply a fascinating read in and of itself: https://github.com/ZipCPU/dspfilters/blob/master/bench/formal/abs_mpy.v

15:00 <tpb> Title: dspfilters/abs_mpy.v at master · ZipCPU/dspfilters · GitHub (at github.com)

15:02 <emeb_mac> Interesting.

15:04 <emeb_mac> Would you call that exercise difficult? I have very little basis for comparison, but it seems somewhat contorted compared to simply running a stimulus / response simulation. Does it provide you with significantly more confidence in the design than a simpler approach?

15:15 <ZipCPU> Not sure.

15:15 <ZipCPU> Let's just say that, in this example, the jury is still out.

15:16 <ZipCPU> Consider this, I'm working with a perfect example right now ... I have 6 types of code for a butterfly. Three use DSP elements, three do not.

15:16 <ZipCPU> The three that use DSP elements work, the three that do not ... don't.

15:17 <ZipCPU> I'm trying to find out why.

15:17 <ZipCPU> If I try to apply formal methods to those other three right now, the formal methods don't complete. The multiply is just too difficult for them.

15:17 <ZipCPU> Even when I bring it down to a three bit multiply they are struggling.

15:19 <ZipCPU> For example, one of those soft multiply-based butterflies has now run its formal proof for over 12 hours, and has only made it to state 14 of 30.

15:21 <ZipCPU> On the other hand, the two butterflies that didn't require hardware multiplies could be formally verified quite quickly.

15:28 <awygle> iceradio continues to look awesome, btw

15:32 <ZipCPU> http://www.iceradio.ca ?

15:38 <promach_> Can https://github.com/ZipCPU/dspfilters/blob/master/bench/formal/abs_mpy.v actually multiply ? why "abstract" ?

15:38 <tpb> Title: dspfilters/abs_mpy.v at master · ZipCPU/dspfilters · GitHub (at github.com)

15:39 <ZipCPU> It's abstract because it's not really a multiply, but yet it still maintains many of the properties of a multiply.

15:42 <ZipCPU> The idea behind abstraction is that if (AB)->C and I can prove that A->C irrespective of B, then I've proved AB->C as well.

15:42 <ZipCPU> It's useful in those cases where B is really hard to express or work with.

15:50 <emeb_mac> awygle: Thanks! I haven't done much with it for the last year due to $DAYJOB getting in the way, but I've got plans for more features.

15:51 <emeb_mac> ZipCPU: Thanks for the insight.

15:55 m_t has quit [Quit: Leaving]

15:57 dxld has quit [Quit: Bye]

15:58 dxld has joined #yosys

16:07 dxld has quit [Quit: Bye]

16:10 dxld has joined #yosys

16:11 luismarques has joined #yosys

16:12 <awygle> The biggest upgrade to the iceradio for my purposes would be a lower power adc. I can't afford 73mW :-(

16:12 <awygle> But the DSP is the hard part from my perspective, I can always replace the afe if I decide to do something with it

16:14 <awygle> Oh it's actually much more. Idk what adc I was looking at lol

16:21 emeb_mac has quit [Quit: Leaving.]

17:34 luismarques has quit [Quit: luismarques]

17:34 luismarques has joined #yosys

17:36 promach_ has quit [Ping timeout: 248 seconds]

17:42 luismarques has quit [Ping timeout: 255 seconds]

17:43 luismarques has joined #yosys

17:44 dxld has quit [Quit: Bye]

17:45 dxld has joined #yosys

17:57 xerpi has quit [Quit: Leaving]

17:59 luismarques has quit [Ping timeout: 256 seconds]

18:05 <cr1901_modern> ZipCPU: I think awygle means this: http://ebrombaugh.studionebula.com/radio/iceRadio/index.html

18:05 <tpb> Title: iceRadio (at ebrombaugh.studionebula.com)

18:05 <cr1901_modern> it does look cool. Idk what I could do w/ it tho

18:05 <awygle> cr1901_modern is correct

18:06 <ZipCPU> Thanks, that makes a lot more sense than the other.

18:07 <cr1901_modern> I've had my ham radio license since the end of 2013; I've made like 4 or so contacts b/c I don't like voice all that much, and there's little to no digital activity

18:12 luismarques has joined #yosys

18:16 luismarques has quit [Ping timeout: 245 seconds]

18:46 luismarques has joined #yosys

19:00 pie_ has joined #yosys

19:07 luismarques has quit [Ping timeout: 255 seconds]

19:11 proteus-guy has quit [Ping timeout: 256 seconds]

19:22 X-Scale has joined #yosys

19:24 proteus-guy has joined #yosys

19:32 luismarques has joined #yosys

19:38 luismarques has quit [Ping timeout: 256 seconds]

19:42 luismarques has joined #yosys

19:53 digshadow has quit [Ping timeout: 248 seconds]

19:59 luismarques has quit [Ping timeout: 264 seconds]

20:01 m_w has joined #yosys

20:27 luismarques has joined #yosys

20:32 luismarques has quit [Ping timeout: 276 seconds]

20:35 sklv has quit [Remote host closed the connection]

20:36 sklv has joined #yosys

20:39 luismarques has joined #yosys

20:42 sklv has quit [Quit: quit]

20:43 luismarques has quit [Ping timeout: 245 seconds]

20:51 luismarques has joined #yosys

21:00 [X-Scale] has joined #yosys

21:01 X-Scale has quit [Ping timeout: 268 seconds]

21:01 [X-Scale] is now known as X-Scale

21:08 luismarques has quit [Ping timeout: 264 seconds]

21:15 emeb_mac has joined #yosys

21:23 sklv has joined #yosys

21:25 digshadow has joined #yosys

21:29 luismarques has joined #yosys

21:38 luismarques has quit [Ping timeout: 248 seconds]

21:41 dys has quit [Ping timeout: 240 seconds]

21:46 luismarques has joined #yosys

21:53 luismarques has quit [Ping timeout: 256 seconds]

21:59 luismarques has joined #yosys

22:07 luismarques has quit [Ping timeout: 256 seconds]

22:07 luismarques has joined #yosys

22:20 luismarques has quit [Ping timeout: 240 seconds]

22:25 luismarques has joined #yosys

22:30 luismarques has quit [Ping timeout: 248 seconds]

22:30 m_w has quit [Ping timeout: 276 seconds]

22:34 luismarques has joined #yosys

22:34 ar3itrary has quit [Ping timeout: 245 seconds]

22:34 digshadow has quit [Quit: Leaving.]

22:38 luismarques has quit [Ping timeout: 240 seconds]

22:39 m_w has joined #yosys

22:44 luismarques has joined #yosys

23:00 X-Scale has quit [Ping timeout: 260 seconds]

23:05 luismarques has quit [Ping timeout: 240 seconds]

23:07 X-Scale has joined #yosys

23:07 seldridge has joined #yosys

23:10 luismarques has joined #yosys

23:18 luismarques has quit [Ping timeout: 245 seconds]

23:19 luismarques has joined #yosys

23:36 luismarques has quit [Ping timeout: 260 seconds]

23:37 luismarques has joined #yosys

23:41 promach_ has joined #yosys

23:46 luismarques has quit [Ping timeout: 256 seconds]

23:51 luismarques has joined #yosys

23:55 tpb has quit [Remote host closed the connection]

23:55 tpb has joined #yosys