#m-labs on 2018-03-23 — irc logs at freenode.irclog.whitequark.org

2015-03-04 14:45 sb0 changed the topic of #m-labs to: ARTIQ, Migen, MiSoC, Mixxeo & other M-Labs projects :: fka #milkymist :: Logs http://irclog.whitequark.org/m-labs

02:07 balrog has quit [Ping timeout: 264 seconds]

02:17 balrog has joined #m-labs

02:33 balrog has quit [Ping timeout: 240 seconds]

02:35 balrog has joined #m-labs

02:45 rohitksingh_work has joined #m-labs

03:16 <GitHub-m-labs> [artiq] jbqubit commented on issue #908: Built from master with SAWG. Loaded onto board that I've had for many months -- not the one that @marmeladapk mentioned is in the mail. ... https://github.com/m-labs/artiq/issues/908#issuecomment-375531041

03:21 <GitHub-m-labs> [artiq] jbqubit commented on issue #908: In subsequent load.... success. ... https://github.com/m-labs/artiq/issues/908#issuecomment-375531969

04:07 sb0 has quit [Quit: Leaving]

04:16 early has quit [Quit: Leaving]

04:19 early has joined #m-labs

04:35 <davidc__> rjo: I don't have a ton of experience, I've mostly just used some Point Grey GIGE vision cameras with their own SDK

04:36 <davidc__> rjo: but I had to debug some stuff, so spent a bit of time fiddling with the SDK

04:36 <davidc__> * and looking at it in wireshark

06:12 sb0 has joined #m-labs

06:58 <sb0> whitequark, i think the errors you're seeing with openocd are just jtag chain confusion with/without rtm.

06:58 <sb0> /usr/local/share/openocd/scripts/board/sayma_amc.cfg doesn't work when the rtm *is* connected

06:58 <sb0> artiq_flash doesn't work when the rtm *is not* connected

07:01 <sb0> whitequark, additionally sayma-3 (florent's board) lacks a hardware rework to make the xadc work I think

07:03 <whitequark> hm okay

07:07 <sb0> whitequark, also you are the first one to try the DACs on sayma-1. like most things on sayma, whether it works or not depends heavily on the particular board (and the phase of the moon). what has been tested and confirmed to work (sometimes) is rtm-1 on sayma-3 (florent's)

07:07 <sb0> though we fixed a number of things since that test; with luck it may work

07:10 <whitequark> alright

07:10 <whitequark> is there an overview of the architecture somewhere?

07:11 <whitequark> or is it just undocumented migen code?

07:11 <sb0> architecture of?

07:11 <whitequark> DACs

07:11 <whitequark> there's this thing called JESD and everything

07:11 <whitequark> I'm not sure how it fits together

07:11 <whitequark> well I guess overall architecture of sayma

07:11 <sb0> there are JESD204 (google it) links between the AMC FPGA and the ADI DAC chips

07:12 <sb0> those DACs also have a SPI interface, which is connected to the RTM FPGA, and is accessed by the AMC FPGA over the serwb bridge

07:15 <whitequark> remind me, why do we have the AMC/RTM split in the first place?

07:15 <whitequark> are there multiple RTM boards planned?

07:17 <sb0> this is somewhat controversial and debatable, but 1) use of the desy clock backplane (which is now pretty much dead) 2) not enough space on one board 3) EMI and thermal considerations

07:19 <whitequark> yes, I see why it's controversial

07:28 <whitequark> ok, I think I understand what JESD204 is now

07:28 <whitequark> 12.5 Gbps, wow

07:29 <sb0> yeah *8

07:29 <whitequark> that explains why the FPGA costs a small fortune

07:30 <sb0> not really; USB3 chips don't cost a small fortune but have similar technology inside

07:30 <sb0> it's mostly xilinx market segmentation

07:30 <sb0> also those transceivers would certainly be cheaper if they weren't designed in such a stupid way

07:30 <whitequark> which stupid way?

07:31 <sb0> pack them with buggy hardwired features that rather belong in the fabric

07:32 <whitequark> ah

07:32 <sb0> plus many other problems that likely don't impact cost on the xilinx side

07:34 <sb0> at least the xilinx transceiver don't self-destruct when they are not clocked, unlike the altera ones

07:36 <whitequark> the altera ones do what?

07:36 <sb0> https://www.altera.com/support/support-resources/knowledge-base/solutions/rd06092016_720.html

07:37 <whitequark> what in the fuck

07:37 <sb0> iirc you also need some transceiver clock source connected in some way to the fpga, otherwise the software workaround doesn't work

07:38 <sb0> and it will of course not warn you if that clock source is malfunctioning

08:02 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #950: Reproducible on the SYSU target. An idle kernel does not appear to be necessary; I can make the error appear by interrupting one regular kernel with another regular kernel (e.g. running ``artiq_run`` while another instance was already running). https://github.com/m-labs/artiq/issues/950#issuecomment-375573467

08:04 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #950: KC705 is also affected. https://github.com/m-labs/artiq/issues/950#issuecomment-375573705

08:15 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #950: I guess what happens is simple: there are events programmed far into the future by the first kernel (since the LED frequency is low). When the second kernel takes over, the value of ``now_mu`` given by ``break_realtime`` is the value of the RTIO counter plus some delta, which is less than the timestamp of some already programmed events.... https://github.com/m-

08:20 sb0 has quit [Quit: Leaving]

08:26 <GitHub-m-labs> [artiq] whitequark commented on issue #950: Or just maintaining a counter somewhere in the runtime that records the maximum timestamp seen. https://github.com/m-labs/artiq/issues/950#issuecomment-375578164

08:31 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #950: No, the runtime is slow enough already, and this would not work with DMA. https://github.com/m-labs/artiq/issues/950#issuecomment-375579144

09:11 <GitHub-m-labs> [artiq] jordens commented on issue #950: Pinning the timestamp (`now_mu`) to the timestamp CSR would also make it survive across kernel evictions/crashes. https://github.com/m-labs/artiq/issues/950#issuecomment-375588352

09:12 <rjo> whitequark: did you have a chance to look at aravis?

09:27 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #950: The timestamp CSR does not necessarily contain the highest value submitted. https://github.com/m-labs/artiq/issues/950#issuecomment-375592283

09:32 <GitHub-m-labs> [artiq] hartytp commented on issue #951: 2017.4 here, although I've subsequently upgraded to 2017.4.1... https://github.com/m-labs/artiq/issues/951#issuecomment-375593664

09:34 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #950: It might be good enough though (and simple and quite consistent with the current break_realtime behavior within a kernel). https://github.com/m-labs/artiq/issues/950#issuecomment-375594106

09:40 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #951: According to Xilinx the .1 does not make a difference for Ultrascale. https://github.com/m-labs/artiq/issues/951#issuecomment-375595698

09:40 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #951: (the one we have at least) https://github.com/m-labs/artiq/issues/951#issuecomment-375595920

09:46 <GitHub-m-labs> [artiq] enjoy-digital commented on issue #908: @jbqubit: thanks for testing. https://github.com/m-labs/artiq/issues/908#issuecomment-375597539

09:49 <GitHub-m-labs> [artiq] jordens commented on issue #950: Yes. Pinning `now` to the CSR might also be a speed advantage (https://github.com/m-labs/artiq/issues/636). The only issue might be atomicity. But it would be an improvement in any case. https://github.com/m-labs/artiq/issues/950#issuecomment-375598343

09:50 <GitHub-m-labs> [artiq] jordens commented on issue #950: Pinning the timestamp (`now_mu`) to the timestamp CSR would also make it survive kernel evictions/crashes. https://github.com/m-labs/artiq/issues/950#issuecomment-375588352

10:19 hartytp has joined #m-labs

10:19 <hartytp> is there an artiq python way of doing something like

10:19 <hartytp> if self.ldac is not None: self.ldac.on()

10:19 <hartytp> (e.g. make it optional)

10:20 <hartytp> (testing Zotino driver atm

10:26 <rjo> hartytp: that will generally run into the unification issue.

10:26 <rjo> hartytp: maybe just make ldac mandatory.

10:42 <hartytp> okay would prefer to do that

10:42 <hartytp> it wasn't in the ad5360 driver

10:42 <hartytp> but that relied on the user initing ldac high, which seems nasty to me imho -- I'd rather the driver owned that pin completely

10:49 <rjo> hartytp: i don't think it relied on that. any external ldac control would have worked.

10:56 <rjo> hartytp: and it's ldac low.

10:56 <rjo> hartytp: on zotino you can rely on having LDAC.

10:58 <hartytp> rjo: at init LDAC should be driven high, no? Only driven low by load...

11:12 <GitHub-m-labs> [artiq] jordens pushed 1 new commit to master: https://github.com/m-labs/artiq/commit/1553fc8c7df7150c00ce853dea71fc6840606f7e

11:12 <GitHub-m-labs> artiq/master 1553fc8 Robert Jordens: sed: reset `valid` in output sorter

11:17 <rjo> hartytp: i meant that load could be controlled externally (by non rtio-gpio if needed or pulled low permanently). in init and it you have ldac, yes, you should set it high.

11:19 <rjo> sb0: i have the feeling that the interrupt (EventManager) is typically involved in the timing paths for mor1kx in misoc on kasli. can we easily add some pipeline registers there?

11:30 <hartytp> seems like the set function in the ad5360 is incorrect

11:30 <hartytp> busy width varies depending on num channels updated

11:30 <hartytp> cf data sheet fig 9

11:30 <hartytp> fixing

11:40 <hartytp> okay, fixed

11:41 <hartytp> gtg now. will try to finish this eve

11:41 hartytp has quit [Quit: Page closed]

11:53 sb000 has joined #m-labs

11:54 <sb000> hartytp, maybe by redefining/overloading methods?

11:54 <sb000> rjo, does it pass timing when removed?

11:54 <sb000> we don't actually need interrupts at all, I think

11:55 <sb000> it's mostly there for legacy reasons

11:56 <bb-m-labs> build #1384 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/1384

11:56 <sb000> hartytp, you could have a 'ldac setter' subdriver in a separate class, and instantiate a dummy with empty methods when ldac is not present

11:58 <GitHub-m-labs> [artiq] whitequark commented on issue #950: > No, the runtime is slow enough already, and this would not work with DMA.... https://github.com/m-labs/artiq/issues/950#issuecomment-375639522

12:01 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #950: The commit part can be done by the gateware when one given 32-bit part of the 64-bit word is written. LLVM would have to know about this, though. https://github.com/m-labs/artiq/issues/950#issuecomment-375644063

12:03 <whitequark> sb000: we need exceptions

12:03 <whitequark> but no, we don't use interrupts at all; used to use them for UART but that's not the case with Rust code

12:04 sb000 has quit [Ping timeout: 260 seconds]

12:12 <GitHub-m-labs> [artiq] jordens commented on issue #950: Maybe just use the timestamp csr instead of the global and drop the kernel protocol to store/load now. That would be a start. https://github.com/m-labs/artiq/issues/950#issuecomment-375646414

12:25 <bb-m-labs> build #805 of artiq-win64-test is complete: Warnings [warnings python_unittest] Build details are at http://buildbot.m-labs.hk/builders/artiq-win64-test/builds/805 blamelist: Robert Jordens <jordens@gmail.com>

12:28 <bb-m-labs> build #2218 of artiq is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/2218

12:31 rohitksingh_work has quit [Read error: Connection reset by peer]

14:06 sb0 has joined #m-labs

14:15 <GitHub172> [smoltcp] steynh opened issue #181: Can't create IcmpSocketBuffer without heap allocation https://github.com/m-labs/smoltcp/issues/181

14:16 <rjo> sb0: the interrupt seems to clear some of the timing failures but not all.

15:20 key2 has joined #m-labs

15:21 <key2> anyone aware of a project dealing with float numbers made with migen ?

15:24 <sb0> key2, https://github.com/nakengelhardt/fpgagraphlib/blob/master/src/faddsub.py

15:26 <key2> thc

15:26 <key2> thx

15:27 <key2> that is a 2 float adder if I get it right?

15:37 <whitequark> rjo: sb0: this is what I get trying to use openocd with sayma-1:

15:37 <whitequark> https://hastebin.com/ujevegixuq.go

15:37 <whitequark> this is sayma-3:

15:37 <whitequark> https://hastebin.com/ogujorehij.js

15:39 <whitequark> what is its problem?

15:44 <rjo> check the 1.8v supply, check the rtm connection, check your openocd script?

15:46 <sb0> key2, yes, and there are other operations in other files in that repos

15:46 <whitequark> it's not my openocd script, it's the one in artiq_flash

15:47 <sb0> whitequark, artiq_flash is *not* going to work on sayma-3 without modifications (no rtm connected)

15:47 <whitequark> okay, but it doesn't work on sayma-1 either

15:47 <whitequark> why?

15:48 <sb0> 1.8v bug from the looks of it

15:48 <sb0> power cycle

15:51 <key2> sb0: yep i saw that

15:52 <key2> sb0: very interesting. i'll try to make a butterfly fft with that

15:54 <GitHub124> [openocd] whitequark pushed 3 new commits to master: https://github.com/m-labs/openocd/compare/0b26b289fb04...c383a57adcff

15:54 <GitHub124> openocd/master a9b5776 whitequark: Fix warnings (-Wimplicit-fallthrough).

15:54 <GitHub124> openocd/master 1b974ef whitequark: Fix warnings (-Wformat-overflow).

15:54 <GitHub124> openocd/master c383a57 whitequark: Fix incorrect fallthrough.

15:55 <GitHub-m-labs> [artiq] cjbe commented on issue #958: @sbourdeauducq this fixes it. I observe no change in timing alignment between master and satellite serdes outputs over 30 restarts. https://github.com/m-labs/artiq/issues/958#issuecomment-375712247

15:56 <whitequark> sb0: yup, 1V8 bug

15:56 <whitequark> thanks!

15:56 <whitequark> should I solder something somewhere?

15:58 <sb0> one of the suggested workarounds (until there is finally bug-free firmware) was to put larger capacitors on 1.8V, and possibly other rails

15:58 <whitequark> you did that on some board already, right?

15:59 <sb0> I just added a small 4.7µF, that's why the boards don't die within a few minutes

16:00 <sb0> the exar capacitance settings are wrong... it's configured for larger caps

16:00 <sb0> though, we don't know yet if that's the *only* problem

16:00 <whitequark> this goddamn board

16:01 <whitequark> sb0: by the way did you know that the mechanical design of sayma amc's front panel is fucked?

16:01 <whitequark> once you plug something into the fmc cage it's not going back out

16:01 <whitequark> erm, sfp cage

16:01 <sb0> oh, and that's a problem with the front panel?

16:01 <sb0> file an issue

16:02 <whitequark> I tried to unplug the sfp cable and discovered that the front panel presses on the part of the cage that should flex

16:02 <whitequark> issue where? on sinara repo?

16:02 <sb0> for now, yes

16:02 <sb0> that was an issue on one kasli too

16:03 <whitequark> looks like it's already in https://github.com/m-labs/sinara/issues/209

16:13 <whitequark> sb0: now it hangs trying to write to the second flash

16:13 <sb0> does it hang or are you just flashing a large binary that takes time to write?

16:13 <whitequark> it doesn't print anything for a long time

16:14 <sb0> it's not printing anything when it writes

16:14 <sb0> only when erasing

16:14 <whitequark> oh

16:14 <whitequark> that's stupid

16:23 <whitequark> rjo: why did you disable slave FPGA bitstream loading?

16:26 <sb0> whitequark, because it doesn't work for sayma reasons and blocks firmware startup

16:26 <whitequark> sb0: can I get Allaki working without RTM bitstream?

16:27 <whitequark> probably not

16:27 <sb0> whitequark, load it with jtag

16:27 <sb0> see /home/sb/load_rtm

16:28 <sb0> minus the sayma intermittent bug, it recovers from AMC reloads, so you can (often) just leave it there

16:29 <sb0> _florent_, any progress fixing serwb? jesd initialization? jesd sc1?

16:32 <whitequark> got it all working. took me only a hour and a half...

16:32 <whitequark> what a waste of time

16:34 <whitequark> sb0: did you have scope probes somewhere?

16:34 <whitequark> I can't find any

16:34 <whitequark> only empty boxes

16:38 <sb0> whitequark, I don't have many; they are around the tables. note that the 2kV probe you had is burned (I replaced it with a 4kV one) so don't use that one

16:41 <_florent_> sb0: i'll work on that next week

16:48 <rjo> _florent_: re serwb, copying the I/O SERDES/DELAY reset/vtc pattern from the ddr phy may or may not make a difference. also note that (IIRC) the reset i implemented for sayma is different from what you have in sayma_test (and maybe litex as well) currently.

16:49 <_florent_> rjo: yes i have to look at what you did, it's probably better

16:54 <whitequark> sb0: can I reduce the DAC frequency to something like 100 MHz?

16:54 <whitequark> or does it have to run at 1.2 GHz?

16:54 <sb0> why reduce it?

16:55 <whitequark> the scope can't cope with 1.2 GHz...

16:55 <sb0> but you're looking at the dac output, not the dac clock

16:55 <sb0> the triangle wave test pattern is within the scope bandwidth

16:56 <whitequark> I'm looking at the dac clock now

16:56 <sb0> why?

16:56 <whitequark> because there is no output from any of the DAC SMAs

16:56 <whitequark> they are all just permanently high

16:56 <sb0> whats on the log?

16:56 <whitequark> nothing interesting

16:56 <sb0> if the dac clock was wrong there would be an error

16:57 <whitequark> define wrong

16:57 <sb0> did it do the prbs tests?

16:57 <whitequark> prbs tests?

16:57 <whitequark> what's that?

16:57 <sb0> see jesd204

16:57 <whitequark> where in artiq is that?

16:58 <sb0> there should be something in the log about it

16:58 <whitequark> there is nothing in the log about prbs...

16:58 <sb0> then the dacs have not started

16:58 <sb0> what is the full log?

16:58 <whitequark> https://hastebin.com/gafiboduwi.sql

16:59 <sb0> yeah it's crashed

16:59 <sb0> serwb bug maybe

17:00 <sb0> there is quite a lot of difference between your gateware and firmware

17:00 <sb0> could it be that? wrong CSR addresses?

17:00 <whitequark> hm

17:01 <whitequark> shouldn't that result in an exception?..

17:01 <whitequark> but yes, I guess so

17:01 <sb0> I don't see how you can catch all firmware/gateware CSR mismatches

17:04 <whitequark> sb0: https://hastebin.com/murisuhube.sql

17:04 <whitequark> no difference

17:04 <sb0> try the known-good amc/rtm combination I posted earlier

17:05 <sb0> i.e. put that rtm on sayma-3

17:05 <sb0> if that doesn't work then I have no idea, haven't seen this bug, try determining where it hangs

17:06 <whitequark> the RTM with the label 2?

17:07 <whitequark> okay...

17:08 <sb0> the rtm that has the resistor soldered into dac_clk_n

17:08 <whitequark> that's the RTM I'm currently using

17:11 <sb0> yes, but on sayma-1. i have not tested this. try on sayma-3.

17:22 <whitequark> sb0: I need to disable hmc830, right?

17:23 <sb0> whitequark, yes

17:24 <whitequark> [ 5.581018s] WARN(board_artiq::ad9154): AD9154-0 config attempt #18 failed (AD9154 SERDES PLL lock timeout), retrying

17:24 <whitequark> does this mean a problem with DAC clock?

17:26 <rjo> it is consistent with that. yes.

17:35 <whitequark> ok, no idea what's happening here

17:35 <whitequark> the clock is definitely present on the SMA

17:44 mumptai has joined #m-labs

18:52 key2 has quit [Ping timeout: 260 seconds]