#m-labs on 2016-12-07 — irc logs at freenode.irclog.whitequark.org

2015-03-04 14:45 sb0 changed the topic of #m-labs to: ARTIQ, Migen, MiSoC, Mixxeo & other M-Labs projects :: fka #milkymist :: Logs http://irclog.whitequark.org/m-labs

03:05 bentley` has quit [Ping timeout: 248 seconds]

04:12 rohitksingh_work has joined #m-labs

05:15 whitequark has quit [Ping timeout: 260 seconds]

05:16 cyrozap has quit [Ping timeout: 260 seconds]

05:17 whitequark has joined #m-labs

05:20 cyrozap has joined #m-labs

05:33 <sb0> whitequark, hadn't you said earlier that llvm was good at optimizing array bound checking? (e.g. hoisting it out of loops)

06:30 kuldeep has quit [Ping timeout: 244 seconds]

06:33 kuldeep has joined #m-labs

09:35 <rjo> whitequark: cython has directives that disable those checks. http://cython.readthedocs.io/en/latest/src/reference/compilation.html#compiler-directives

10:10 rohitksingh_wor1 has joined #m-labs

10:13 rohitksingh_work has quit [Ping timeout: 260 seconds]

10:33 bentley` has joined #m-labs

12:18 sandeepkr has joined #m-labs

12:28 <whitequark> sb0: sure. but it's not magic and it cannot generally optimize checks away, just merge existing ones when in absence of aliasing

12:29 <whitequark> when I add aliasing information for fields that might improve things at very low cost

12:30 <whitequark> rjo: perhaps. but i would be cautious. we do not have any tools that can detect out-of-bounds accesses and the way we perform allocation means that out-of-bounds accesses will just result in bogus data returned most of the time

12:30 <whitequark> (the stack frames generally contain large contiguous chunks of data with few interspersed pointers)

12:31 <whitequark> *maybe* we could enable stack smashing protection as a first line of defense against this. it's extremely cheap, much cheaper than the checks...

12:31 <whitequark> rjo: what I would much prefer is an extension to mor1kx that moves bounds checking to hardware.

12:32 <rjo> whitequark: a.k.a. MMU?

12:33 <whitequark> rjo: nope, an MMU would not help us at all.

12:33 <whitequark> unless we're adding a heap allocator and everything.

12:34 <whitequark> well, I guess we could allocate on stack in 4k granularity but that will have worst cases with small arrays

12:36 <whitequark> rjo: let me think of some unobtrustive way to implement it

12:36 <rjo> whitequark: but in our case out-of-bounds is not worse than in on a regular OS. just that the allocator is different.

12:36 <whitequark> rjo: not quite.

12:36 <whitequark> on a regular OS you have valgrind and ubsan

12:37 <whitequark> ubsan especially is taking advantage of "shadow pages" to drive cost of the checks quite low

12:37 <rjo> whitequark: in our case one would toggle the "fast-but-dangerous" flag and get exceptions.

12:38 <whitequark> so what I expect to happen is that people will get used to having the "fast-but-dangerous" flag on all the time.

12:38 <whitequark> then get bogus data.

12:38 <whitequark> on an OS you will get crashes pretty quickly because you have weird pointers

12:38 <whitequark> Python has pointers stuffed everywhere throughout its heap and overwrite it in a way that silently does a wrong thing is hard

12:39 <whitequark> we will also crash on invalid pointers about 3/4 of time because of alignment errors, even without an MMU

12:39 <whitequark> since we own the CPU why cannot we drive the cost of checks down instead?

12:39 <whitequark> e.g. a dedicated "bounds check" instruction.

12:42 <rjo> sure. but you still have to carry around the bounds data everywhere.

12:47 <whitequark> but we already do.

12:47 <whitequark> the slices (and strings, soon) are struct { len, ptr } that are passed by value.

12:48 <whitequark> this for one allows slicing that has essentially zero cost because it's just two arithmetic operations

12:58 ohama has quit [Read error: Connection reset by peer]

13:00 ohama has joined #m-labs

13:00 rohitksingh_wor1 has quit [Read error: Connection reset by peer]

13:33 rohitksingh has joined #m-labs

15:03 <GitHub118> [artiq] sbourdeauducq pushed 1 new commit to master: https://git.io/v1un2

15:03 <GitHub118> artiq/master 4c37179 Sebastien Bourdeauducq: drtio: link layer debugging CSRs

15:10 <sb0> rjo, we need to put some of the hardware initialization into the runtime, because we need the clock chips to work before we can use the DRTIO transceivers

15:11 <sb0> in this case, can't it just initialize the JESD links at the same time?

15:11 <sb0> also, the DRTIO protocol implementation won't extrapolate well to SPI

15:12 fengling has quit [Ping timeout: 268 seconds]

15:21 fengling has joined #m-labs

15:26 <bb-m-labs> build #253 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/253

15:40 <GitHub175> [artiq] sbourdeauducq commented on issue #636: Merging the address into the channel sounds OK. https://git.io/v1uBj

15:49 <bb-m-labs> build #1153 of artiq is complete: Failure [failed python_unittest_1] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1153 blamelist: Sebastien Bourdeauducq <sb@m-labs.hk>

15:50 <GitHub69> [artiq] sbourdeauducq commented on issue #636: The DMA playback engine takes LSB-first data of arbitrary length with byte granularity and zeros the missing MSBs, and DRTIO similarly removes zeros in front of data.... https://git.io/v1u0c

15:59 <rjo> sb0: why does DRTIO not work for SPI?

16:00 <sb0> well some parts can be recycled of course, but it's not completely straightforward

16:00 <rjo> sb0: ack the autonomous clock tree and JESD bootstrapping.

16:01 <rjo> sb0: but still: in the end the SPI interface to the DAC (also) needs to be exposed to the user.

16:02 <sb0> for example, DRTIO needs a framing signal. in SPI we can use CS, unless the chipmaker designed the SPI core as you did where you don't precisely control CS

16:02 <rjo> sb0: do you mean the non-RTIO SPI in phaser? or the generic RTIO SPI master?

16:03 <rjo> in the RTIO SPI PHY, CS is precisely controlled.

16:03 <sb0> but it's not if you connect that core to a CPU, and in your defense, you said that another chipmaker (motorola?) also did that

16:04 <rjo> it is not always controlled precisely if you do chained transfers. otherwise it is.

16:04 <sb0> but for framing a DRTIO frame you'd need chained transfers

16:04 <sb0> or the protocol needs to be changed in some way

16:05 <rjo> and i'd be happy to accept a patch that only releases CS if all bits of a chained transfer are transferred.

16:06 <sb0> yes, but that doesn't help if someone is using another SPI master with an imprecisely controlled CS

16:06 <rjo> sb0: i don't understand what you are saying. if you do a chained SPI transfer over DRTIO you just have to set the timestamp so that it will be chained.

16:07 <sb0> Jonathan wants to use Sayma with an undefined SPI master, not MiSoC/ARTIQ stuff

16:07 <rjo> yeah. but that's an SPI slave then.

16:07 <rjo> i just wrote one for PDQ3

16:07 <sb0> if that SPI master can't control CS precisely then we can't use it as DRTIO framing signal

16:08 <sb0> I'm just using the MiSoC SPI core as an example design that cannot always control CS precisely

16:09 <rjo> are you talking about jonathan's idea of RTIO-over-SPI-over-cpu-over-RTIO? or the flat abstraction of Sayma into a SPI "peripheral"

16:10 <sb0> Sayma into a SPI peripheral

16:11 <sb0> there is no problem putting the SPI PHY into a DRTIO channel

16:11 <rjo> ok. if CS comes a "late" after the last relevane bit, why is that a problem for DRTIO framing?

16:12 <rjo> ack. let's call that thing (which he doesn't want (yet)) "DRTIO-over-SPI".

16:12 <sb0> insert just one bit due to a clock glitch (e.g. at power up) and all SPI comms break down as they lose sync

16:12 <rjo> but that would be no CS control at all.

16:12 <sb0> again that can be solved, but it's not just "plug SPI into the other end of the DRTIO receiver"

16:14 <rjo> in my mind, the SPI slave would shift in a variable amount of data, and when CS is deasserted, push that framed paket into the same pipeline the DRTIO pakets would go into.

16:14 <rjo> when CS is asserted, start a new paket.

16:14 <rjo> yes. this breaks if there is no proper CS.

16:15 <sb0> also SPI will come with its own clock that has no relation to the RTIO clock and isn't even free-running, unlike DRTIO which is fully synchronous

16:16 <rjo> SPI controllers either have precise control over CS or then just don't do clock cycles before and after the actual data.

16:16 <rjo> yes. just like for PDQ.

16:17 <sb0> btw how are you sampling the SPI clock?

16:17 <rjo> precise CS is not needed as long as there are no extra clock cycles.

16:17 <rjo> with hysteresis.

16:17 <sb0> what if there are extra clock cycles due to power-up glitches?

16:17 <sb0> hysteresis?

16:17 <rjo> aka debouncing.

16:18 <rjo> https://github.com/m-labs/pdq2/blob/master/gateware/spi.py

16:18 <sb0> you mean there is a schmitt trigger on the pcb?

16:18 <sb0> ah, ok good

16:18 <rjo> no. multiregs and then a hysteretic debouncer.

16:19 <rjo> if there are extra cycles due to glitches then (a) if CS is deasserted they don't matter and (b) if CS happens to be asserted as well due to a glitch then they constitute an incomplete packet and when CS is deasserted, the short packet is just canceled.

16:20 <sb0> so all transfers below, say, 32 bits are discarded?

16:21 <rjo> below minimum drtio paket length. i'd guess that's timestamp + channel number + a few data bits.

16:21 <sb0> i.e. the glitches would need to consist of >=32 clock pulses plus CS asserted at all times to cause trouble

16:21 <rjo> yes.

16:21 <sb0> it's still kinda fragile, a software bug on the other end can easily produce that, and you can't reset

16:21 <sb0> maybe add a gpio reset line?

16:22 <rjo> or we could even do magic interface-enable sequences.

16:22 <rjo> but it doesn't break much. it only inserts an event into a RTIO FIFO.

16:22 <rjo> plus a reset command.

16:22 <sb0> if CS is not used as framing signal, it can desych the whole SPI comms

16:22 <rjo> the framing is self-healing.

16:23 <rjo> yes.

16:23 <whitequark> rjo: ack re: RTIO over SPI

16:23 <rjo> but i'd expect CS.

16:23 <sb0> so that's not just a spurious rtio event, that's losing control of the device

16:23 <rjo> without CS you'd loose control. yes.

16:24 <rjo> but doing SPI without CS is masochistic.

16:25 <whitequark> rjo: I'm curious. you're mentioning a "spline knot". so is the phaser branch using ADCs to output waveforms defined by splines? which type?

16:25 <rjo> yes. b-splines.

16:26 <whitequark> that's remarkably flexible

16:26 <whitequark> I should figure out how it works

16:26 <whitequark> are the internals documented anywhere?

16:30 <rjo> whitequark: http://pdq2.readthedocs.io/en/latest/architecture.html#spline-interpolation and spline.py in gateware and coredevice

16:30 <whitequark> larsc: https://pbs.twimg.com/media/CzFdspIXAAA2lmp.jpg:large is this... a microwave breadboard?

16:30 <whitequark> rjo: thanks

16:31 <rjo> yep. that's a microwaye breadboard.

16:32 <rjo> wave

16:33 <rjo> but afaict in practice they tend to just buy optical breadboards and bolt stuff down on those, still using coax cables to connect boxes.

16:35 <GitHub194> [artiq] jordens commented on issue #636: The automatic zero-stripping/extending sounds good.... https://git.io/v1uV6

16:37 zoobab has quit [Ping timeout: 245 seconds]

17:46 stekern has quit [Ping timeout: 260 seconds]

17:46 stekern has joined #m-labs

18:46 rohitksingh has quit [Quit: Leaving.]

19:22 <whitequark> rjo: ooh, I haven't realized you can compute b-splines using just addition.

19:23 <whitequark> hm , there's also cordic, i never understood how that works. but i guess there is already documentation for it

19:24 <whitequark> the gateware is really not complex

19:56 mumptai has joined #m-labs

20:40 <GitHub168> [artiq] r-srinivas commented on issue #407: @whitequark @sbourdeauducq Would this be related to #637 or an independent problem? https://git.io/v1zln

21:06 sandeepkr has quit [Ping timeout: 265 seconds]

23:53 mumptai has quit [Remote host closed the connection]