#m-labs on 2018-08-29 — irc logs at freenode.irclog.whitequark.org

2018-08-01 14:34 sb0 changed the topic of #m-labs to: https://m-labs.hk :: Logs http://irclog.whitequark.org/m-labs :: Due to spam bots, only registered users can talk. See: https://freenode.net/kb/answer/registration

00:06 futarisIRCcloud has joined #m-labs

00:44 Gurty has quit [Excess Flood]

00:45 Gurty has joined #m-labs

00:45 Gurty has quit [Changing host]

01:00 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1139: > What complexity and bugs are you referring to?... https://github.com/m-labs/artiq/issues/1139#issuecomment-416788332

01:19 bb-m-labs has quit [*.net *.split]

01:19 bb-m-labs has joined #m-labs

01:37 futarisIRCcloud has quit [Read error: Connection reset by peer]

01:37 futarisIRCcloud has joined #m-labs

02:42 balrog has quit [*.net *.split]

02:42 adamgreig has quit [*.net *.split]

02:42 Astro- has quit [*.net *.split]

02:42 Astro- has joined #m-labs

02:43 adamgreig has joined #m-labs

02:48 adamgreig is now known as Guest33733

02:58 balrog has joined #m-labs

03:18 _whitelogger has joined #m-labs

03:47 rohitksingh_work has joined #m-labs

03:54 mumptai_ has joined #m-labs

03:58 mumptai has quit [Ping timeout: 268 seconds]

05:56 <GitHub-m-labs> [artiq] klickverbot commented on issue #1115: Will have a look at this hopefully later today, currently traveling. https://github.com/m-labs/artiq/pull/1115#issuecomment-416833410

06:57 <7GHAAAJWP> [artiq] jordens closed issue #1139: remove device_db alias support https://github.com/m-labs/artiq/issues/1139

06:57 <5EXAAAD0Y> [artiq] jordens commented on issue #1139: ACK. Let's file a new issue for those. https://github.com/m-labs/artiq/issues/1139#issuecomment-416845857

06:59 <GitHub-m-labs> [artiq] jordens opened issue #1140: device_db alias corner case bugs https://github.com/m-labs/artiq/issues/1140

07:37 Guest33733 has quit [Quit: WeeChat 1.8]

07:37 adamgreig has joined #m-labs

08:37 <GitHub-m-labs> [artiq] hartytp commented on issue #801: > I'd like to hear from other users before cutting AM/PM, but it's fine from my vantage point especially if it makes resource counts go from 'more than we have' to 'workable'.... https://github.com/m-labs/artiq/issues/801#issuecomment-416872415

08:44 <GitHub-m-labs> [artiq] hartytp commented on issue #801: > @hartytp This is actually to feed-forward frequency noise from the laser we use for our 2-qubit gates. We're correcting for acoustical noise in the laser cavity (dominant noise peaks are a few hundred Hz), not slow drifts (like the ones you mention) that we intend to address by recomputing.... https://github.com/m-labs/artiq/issues/801#issuecomment-416874751

08:51 sb0 has quit [Ping timeout: 244 seconds]

08:52 sb0 has joined #m-labs

09:01 attie has quit [Remote host closed the connection]

09:01 attie has joined #m-labs

09:17 futarisIRCcloud has quit [Quit: Connection closed for inactivity]

09:22 sb0 has quit [Ping timeout: 240 seconds]

09:25 hartytp has joined #m-labs

09:25 <hartytp> larsc: thanks

09:25 <hartytp> I'd seen that about the JESD204B DACs

09:26 <hartytp> but, we have a few use-cases that need a DAC that's DC precise/low-noise, but that has a sample rate of 10MHz or so, so a few times higher than SPI can provide

09:26 <hartytp> in that case, JESD204B is overkill (and, in any case, most of those DACs aren't great at DC)

09:26 <hartytp> so, it's really nice to have a parallel 16-bit DAC with decent noise/temp co etc, but it seems that ADI is moving away from providing those

09:40 sb0 has joined #m-labs

10:08 <GitHub-m-labs> [artiq] cjbe opened issue #1141: Urukul channels sporadically lock up until reset https://github.com/m-labs/artiq/issues/1141

10:15 <GitHub-m-labs> [artiq] jordens commented on issue #1141: This is AD9910-specific. I have seen it as well. And we know that its bigger brother the AD9914 is equally easy to bring into a locked up state. I'd hypothesize that a multi-transfer SPI transcation is interrupted mid-way (due to an underflow or due to bad initialization or due to some other reason) and the AD9910 machinery is driven into a non-recoverable state by subs

10:28 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1141: > In this experiment we have a Kasli master and DRTIO satellite, with one Urukul on the master, and 2 on the satellite. We have seen channels on both the master and satellite lock up.... https://github.com/m-labs/artiq/issues/1141#issuecomment-416904872

10:32 <GitHub-m-labs> [artiq] cjbe commented on issue #1141: @sbourdeauducq yes - we have two Kasli master-satellite DRTIO setups running at 125 MHz. They have both been running for several months without any problems. https://github.com/m-labs/artiq/issues/1141#issuecomment-416905765

10:36 <GitHub-m-labs> [artiq] gkasprow commented on issue #1141: Maybe we should add simple power management over I2C? Just IO extender + PMOS. On by default. https://github.com/m-labs/artiq/issues/1141#issuecomment-416906781

10:39 <GitHub-m-labs> [artiq] hartytp commented on issue #1141: @gkasprow better to make sure that the code doesn't put the AD9910 into this state to begin with (I've used AD9910s continuously updating for months without any issues, so I know that they can be programmed reliably). https://github.com/m-labs/artiq/issues/1141#issuecomment-416907482

10:42 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1141: @hartytp What is the difference in the programming sequence? https://github.com/m-labs/artiq/issues/1141#issuecomment-416908281

10:54 <GitHub-m-labs> [artiq] cjbe commented on issue #1141: @sbourdeauducq the legacy systems @hartytp is referring to just use an FSM that cannot be interrupted to generate the SPI transactions. I think the point is that it is the interruption of the SPI transaction in the middle of a multiword transfer that causes the DDS to misbehave, rather than an inherent bug in the DDS itself if one completes the transactions properly.

10:57 <GitHub-m-labs> [artiq] hartytp commented on issue #1141: exactly https://github.com/m-labs/artiq/issues/1141#issuecomment-416911906

10:59 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1141: There is an inherent bug in the DDS chip if it doesn't recover from interrupted SPI transactions after a master reset. Resets ought to be able to clear any chip state that can be programmed from a digital interface. https://github.com/m-labs/artiq/issues/1141#issuecomment-416912406

11:05 <GitHub-m-labs> [artiq] sbourdeauducq commented on issue #1141: And, while it is not strictly a bug, it is poor design that interrupted transfers put the chip into a broken state. Try breaking the Si5324 by comparison. But, at least the master reset on the AD9910 puts the chip into a working state, contrary to the AD9914 where everything is borked after a reset. Sigh...... https://github.com/m-labs/artiq/issues/1141#issuec

11:07 <GitHub-m-labs> [artiq] hartytp commented on issue #1141: > The only advantage I see to the software workaround is supporting the existing Urukul fleet.... https://github.com/m-labs/artiq/issues/1141#issuecomment-416914410

11:08 <GitHub-m-labs> [artiq] hartytp commented on issue #1141: > The only advantage I see to the software workaround is supporting the existing Urukul fleet.... https://github.com/m-labs/artiq/issues/1141#issuecomment-416914410

12:16 sb0 has quit [Ping timeout: 252 seconds]

12:25 <GitHub-m-labs> [artiq] jordens commented on issue #675: As this is exclusively monitoring and injection for the sum, there are 8 injection and 8 monitoring channels. that could still be doable with the current design. I should have numbers on the resource/routing impact soon. https://github.com/m-labs/artiq/issues/675#issuecomment-416933434

12:28 sb0 has joined #m-labs

12:33 rohitksingh_work has quit [Read error: Connection reset by peer]

12:49 <GitHub-m-labs> [artiq] jordens commented on issue #801: @hartytp ... https://github.com/m-labs/artiq/issues/801#issuecomment-416940356

13:03 key2_ has joined #m-labs

13:05 key2 has quit [Ping timeout: 252 seconds]

13:13 hartytp has quit [Quit: Page closed]

13:13 rohitksingh has joined #m-labs

13:28 mumptai_ has quit [Quit: Verlassend]

13:45 hjr3 has quit [Quit: ZNC - 1.6.0 - http://znc.in]

14:36 X-Scale has quit [Ping timeout: 244 seconds]

14:41 rohitksingh has quit [Quit: Leaving.]

15:03 rohitksingh has joined #m-labs

15:13 rohitksingh has quit [Quit: Leaving.]

15:39 rohitksingh has joined #m-labs

15:40 X-Scale has joined #m-labs

15:45 rohitksingh has quit [Quit: Leaving.]

15:48 rohitksingh has joined #m-labs

15:55 rohitksingh has quit [Quit: Leaving.]

15:57 rohitksingh has joined #m-labs

16:08 rohitksingh has quit [Quit: Leaving.]

16:30 <rjo> sb0: how is SIPHASER_SKEW=32 determined?

16:32 <sb0> rjo: manually, look where errors happen and put it in the middle of the working zone

16:33 <rjo> errors == siphaser alignment failures?

16:33 <sb0> no, drtio data corruption

16:34 <sb0> this controls the skew between the noisy transceiver recovered clock and the si5324 output

16:34 <sb0> the transceiver outputs data in its recovered clock domain, and it is re-registered into the si5324 output domain by RXSynchronizer

16:35 <sb0> RXSynchronizer is a bunch of FFs placed next to each other, and has large data eyes. most SIPHASER_SKEW values work.

16:35 <rjo> sb0: without a CDC?

16:36 <sb0> CDCs don't have deterministic latency

16:36 <rjo> or RXSynchronizer is a "CDC that assumes reasonably well aligned phase"

16:37 <sb0> it's a data recapture in a phase-aligned domain. if you set it to None it gets replaced with an elastic buffer (useful for debugging if you suspect problems there)

16:37 <sb0> an elastic buffer gives you latency non-determinism instead of data corruption when the phases aren't right

16:38 <rjo> yeah. i get that.

16:39 <rjo> coarse (1/rtio_freq) non-determinism.

16:39 <sb0> if we start getting a lot of problems with this thing, we should write an autocal routine for it. send PRNG data through RXSynchronizer and scan the phases

16:39 <sb0> btw, there are several WR implementations that have the transceiver elastic buffer enabled, and just assume that the clock phase is right for deterministic latency ...

16:41 <rjo> hmm. you LOCed them. for async reset synchronizers which are the same structure (FFs close together) the recommended way is with ASYNC_FF and a timing constraint. just FYI. I am not keen on rewriting it.

16:41 <rjo> or RLOC. well I guess that's fine.

16:42 <rjo> and SIPHASER_PHASE is stable enough across rebuilds/logic added/removed?

16:43 <sb0> on 7-series yes, but I'm not really sure on ultrascale

16:45 <rjo> (SKEW, not PHASE) and you'd expect SIPHASER_SKEW to be a constant **delay** and not necessarily a constant number of PSINCDECs, right?

16:45 <sb0> it is definitely not a constant number of PSINCDEC, since the latter depends on what skew the Si5324 has after locking

16:46 <sb0> it is the target skew between the CDR output and the Si5324 output

16:47 <rjo> i mean a constant offset of PSINCDECs from the 0->1 transition.

16:48 <sb0> ah, yes

16:49 <rjo> and finally: any reason to choose 1200 MHz for that second MMCM VCO and not something closer to the max int(1440 MHz/rtio_clk_freq)?

16:49 <rjo> ... i meant the divider closer to the max, which is int(...)

16:51 <rjo> ... in order to get max phase shift resolution.

16:51 <sb0> no. afaict I just read the datasheet wrong and thought the maximum frequency was 1200, while it is 1440 for that speed grade

16:54 <rjo> ok. SIPHASER_SKEW both 32 for kasli and sayma is that coincidence?

16:55 <rjo> sb0: ah. i guess 1200 MHz because it is the max for our Sayma grade.

16:57 <sb0> sayma is less tested, I just modified that value around a bit and the margins looked fine

16:58 <sb0> I guess the best/rigorous way to do this is the autocal with PRNG. also helps with pinning down any P&R/P/V/T variations

16:59 <sb0> it could scan around the programmed value and check the margins like the jesd204 sysref code

17:03 <sb0> could be PRNG or just flip all bits on every cycle

17:05 <rjo> can't we use the data strem from the master, i.e. assume mostly K

17:08 <GitHub-m-labs> [migen] jordens pushed 1 new commit to master: https://github.com/m-labs/migen/commit/97e26516292749cb5deac62e68f3911818fb9eb0

17:08 <GitHub-m-labs> migen/master 97e2651 Robert Jördens: kasli: set USERID and USR_ACCESS...

17:08 <sb0> the rx synchronizer is after 8b10b decoding

17:09 <sb0> moving it before needs changing the code + breaks compatibility with potential transceivers with built-in 8b10b decoders that cannot be disabled

17:09 <sb0> the K sequence is a much inferior test pattern than 0xffff / 0x0000

17:10 <sb0> the rest of the stack needs to be disabled anyway while doing the calibration to avoid processing garbage data, so a gateware change is needed anyway

17:11 <sb0> finally, assuming the K sequence from the transceiver increases fragility

17:12 <sb0> (what if the master wasn't sending Ks, what if the receiver isn't working properly?)

17:13 <rjo> hmm. why is it not sufficient to just use the noisy GT clock again and align to that? what am i missing.

17:13 <rjo> wouldn't you be doing the same thing with 0xffff/0x0000?

17:15 <rjo> btw. i noticed that all my cases where i had "nothin on the UART" with kasli bitstreams were simply due to v1.0/v1.1 mismatch. weren't hartytp and marmelada also reporting those?

18:12 <GitHub-m-labs> [artiq] jordens pushed 6 new commits to master: https://github.com/m-labs/artiq/compare/a5cd7d27612f...e7dba344759c

18:12 <GitHub-m-labs> artiq/master ccc58a0 Robert Jördens: satman: add 125 MHz Si5324 settings...

18:12 <GitHub-m-labs> artiq/master eb9e963 Robert Jördens: siphaser: support 125 MHz rtio clk...

18:12 <GitHub-m-labs> artiq/master 9584c30 Robert Jördens: kasli: DRTIO Base: flexible rtio_clk_freq

18:57 hartytp has joined #m-labs

18:57 <hartytp> rjo: I never reported that for Kasli, only Sayma

18:57 <hartytp> Kasli always worked fine for me

19:00 <GitHub163> [smoltcp] pothos commented on issue #106: This can be closed, or? :) https://github.com/m-labs/smoltcp/issues/106#issuecomment-417067517

19:08 hartytp has quit [Quit: Page closed]

19:12 <GitHub29> [smoltcp] pothos opened issue #260: Poll panic on corrupted input https://github.com/m-labs/smoltcp/issues/260

19:21 <GitHub-m-labs> [artiq] hartytp closed pull request #1105: [NFC] Sayma: hmc7043 add explanation of input buffer configuration, and do … (master...master) https://github.com/m-labs/artiq/pull/1105

19:32 kuldeep has quit [Ping timeout: 264 seconds]

19:34 kuldeep has joined #m-labs

19:38 <rjo> hartytp: ok. i misremembered then.

20:22 mumptai has joined #m-labs

21:04 little-dude has joined #m-labs

21:13 <GitHub74> [smoltcp] dlrobertson commented on issue #260: Do you have a packet capture of the packet that caused this? https://github.com/m-labs/smoltcp/issues/260#issuecomment-417107804

21:40 <GitHub-m-labs> [artiq] drewrisinger commented on issue #1138: Confirmed. Misspelled `m-labs` as `mlabs`. Fixed the issue. https://github.com/m-labs/artiq/issues/1138#issuecomment-417115146

22:22 <GitHub34> [smoltcp] pothos commented on issue #260: Unfortunately not, but I'll try to reproduce it with a somehow patched version that dumps the packet (sounds possible) or maybe in gdb… https://github.com/m-labs/smoltcp/issues/260#issuecomment-417126163

22:41 <GitHub65> [smoltcp] pothos opened issue #261: Debug panic in packet assembler https://github.com/m-labs/smoltcp/issues/261

22:45 <GitHub85> [smoltcp] jhwgh1968 commented on issue #106: I think so, but since @whitequark opened it, I don't have the power to close it. https://github.com/m-labs/smoltcp/issues/106#issuecomment-417131304

22:52 <GitHub52> [smoltcp] whitequark closed issue #106: Implement TCP window scaling https://github.com/m-labs/smoltcp/issues/106

23:27 benreynwar has joined #m-labs