sb0 changed the topic of #m-labs to: ARTIQ, Migen, MiSoC, Mixxeo & other M-Labs projects :: fka #milkymist :: Logs http://irclog.whitequark.org/m-labs
Gurty has quit [Ping timeout: 240 seconds]
Gurty has joined #m-labs
<sb0> why is timing failing now? I didn't touch the gateware since the previous build
<sb0> sigh
<sb0> well, it could be that the design is borderline and it says it passes timing when it actually does not
<sb0> I've seen bugs due to the shitty xilinx timing models that went away when cooling the fpga to negative temperatures ...
<sb0> whitequark, do you have a repro for the DMA bug?
<sb0> I just ordered a freezer spray
<sb0> WTF? when building outside of the buildbot, it passes timing
<sb0> oh fuck xilinx garbage
<sb0> bb-m-labs, force build artiq
<bb-m-labs> build forced [ETA 44m49s]
<bb-m-labs> I'll give a shout when the build finishes
<sb0> bb-m-labs, force build --branch=release-2 artiq
<bb-m-labs> The build has been queued, I'll give a shout when it starts
<GitHub> [artiq] sbourdeauducq commented on issue #684: Well, it's not a bug, and we'd like to minimize the amount of code changes on release branches. https://github.com/m-labs/artiq/issues/684#issuecomment-289653729
<bb-m-labs> build #477 of artiq-board is complete: Failure [failed conda_build] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/477
<bb-m-labs> build #1413 of artiq is complete: Failure [failed] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1413
<bb-m-labs> build forced [ETA 44m49s]
<bb-m-labs> I'll give a shout when the build finishes
rohitksingh_work has joined #m-labs
<bb-m-labs> build #478 of artiq-board is complete: Failure [failed conda_build] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/478
<bb-m-labs> build #1414 of artiq is complete: Failure [failed] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1414
sb0 has quit [Quit: Leaving]
mumptai has joined #m-labs
mumptai has quit [Remote host closed the connection]
<GitHub> [artiq] jordens pushed 2 new commits to pdq2: https://github.com/m-labs/artiq/compare/0ad89a15a332...37904956322c
<GitHub> artiq/pdq2 3790495 Robert Jordens: pdq2: crc/frame register accessors
<GitHub> artiq/pdq2 1fa54e3 Robert Jordens: doc: pdq2 spi backend
rohitksingh_work has quit [Ping timeout: 240 seconds]
rohitksingh_work has joined #m-labs
rohitksingh_work has quit [Ping timeout: 260 seconds]
rohitksingh_work has joined #m-labs
sb0 has joined #m-labs
rohitksingh_work has quit [Ping timeout: 246 seconds]
rohitksingh_work has joined #m-labs
<sb0> bb-m-labs, force build --branch=release-2 artiq
<bb-m-labs> I'll give a shout when the build finishes
<bb-m-labs> build forced [ETA 44m49s]
sb0 has left #m-labs [#m-labs]
sb0 has joined #m-labs
<sb0> bb-m-labs, force build artiq
<bb-m-labs> The build has been queued, I'll give a shout when it starts
rohitksingh_wor1 has joined #m-labs
rohitksingh_work has quit [Ping timeout: 260 seconds]
<bb-m-labs> build #479 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/479
sb0 has quit [Quit: Leaving]
<bb-m-labs> build #443 of artiq-win64-test is complete: Warnings [warnings python_unittest] Build details are at http://buildbot.m-labs.hk/builders/artiq-win64-test/builds/443
<bb-m-labs> build #1415 of artiq is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1415
<bb-m-labs> build forced [ETA 36m26s]
<bb-m-labs> I'll give a shout when the build finishes
<bb-m-labs> build #480 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/480
sb0 has joined #m-labs
rohitksingh_wor1 has quit [Read error: Connection reset by peer]
<bb-m-labs> build #444 of artiq-win64-test is complete: Failure [failed python_unittest coverage_combine] Build details are at http://buildbot.m-labs.hk/builders/artiq-win64-test/builds/444
<bb-m-labs> build #1416 of artiq is complete: Failure [failed] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1416
<sb0> bb-m-labs, force build artiq
<bb-m-labs> build forced [ETA 36m26s]
<bb-m-labs> I'll give a shout when the build finishes
<whitequark> sb0: sure: https://hastebin.com/bujoqukima.py
<whitequark> it hangs on the 3rd iteration
<whitequark> always
<bb-m-labs> build #481 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/481
rohitksingh has joined #m-labs
<sb0> whitequark, why does it say "Interrupting a running kernel" here → http://buildbot.m-labs.hk/builders/artiq-win64-test/builds/445/steps/python_unittest/logs/stdio
<sb0> in test_dma_nested
<sb0> *Interrupted
<sb0> the message also appears when the test is run under linux, http://buildbot.m-labs.hk/builders/artiq/builds/1417/steps/python_unittest_2/logs/stdio
<whitequark> I don't know
<sb0> whitequark, well, this looks fishy. are you sure the bug is in the gateware?
<whitequark> actually, nevermind, I know
<whitequark> that's precisely because of the hang
<whitequark> anyway, yes, I'm sure; whether tests fail or pass is irrelevant, because it still happens with the minimized testcase above
<sb0> how does a hang make this message appear?
<sb0> in the minimized repro, do you need to put core_dma.record() into the loop for the bug to appear?
<whitequark> no
<bb-m-labs> build #445 of artiq-win64-test is complete: Failure [failed python_unittest coverage_combine] Build details are at http://buildbot.m-labs.hk/builders/artiq-win64-test/builds/445
<bb-m-labs> build #1417 of artiq is complete: Failure [failed] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1417
<whitequark> this also breaks
<sb0> whitequark, I still don't understand how the DMA core hanging produces the "interrupted" message
<whitequark> sb0: well, the entire kernel hangs
<sb0> whitequark, does this also break when a different RTIO channel (ttl) is used in each DMA sequence? I want to avoid the possibility of e.g. a mishandled RTIO sequence error
<sb0> whitequark, if the kernel hangs then why does @kernel return?
<whitequark> I don't know, it's probably something about the testbench
<sb0> hmm
FabM has quit [Ping timeout: 246 seconds]
<whitequark> I haven't tried different channels
<sb0> whitequark, I'm not convinced. this looks like a runtime bug.
<sb0> this is the first time the DMA code is called, it only calls record() (no gateware)
<sb0> and it prints this "interrupted a running kernel" message.
<whitequark> sb0: then explain why the minimized example fails.
<whitequark> including if I hack it to not involve any runtime code at all
<sb0> there can be several bugs
<whitequark> I'm not convinced this *is* a runtime bug
<whitequark> but sure, I can waste some time figuring out exactly why the testsuite fails like that
<sb0> you do this:
<sb0> @kernel
<sb0> def nested(self):
<sb0> with self.core_dma.record(self.trace_name):
<sb0> pass
<sb0> with self.core_dma.record(self.trace_name):
<sb0> and it prints "Interrupted a running kernel"
<sb0> well. let me check if this is actually DMA related, or due to the test_watchdog test run before it
<whitequark> [ 321446188us] ERROR(runtime::session): session aborted: watchdog expired
<whitequark> in the core log
<whitequark> and the "Interrupted a running kernel" message is displayed every time a session is not finished cleanly
<whitequark> regardless of how
<sb0> ok, so this is a minor and straightforward runtime bug
<whitequark> management interface*
<GitHub> [artiq] whitequark pushed 2 new commits to master: https://github.com/m-labs/artiq/compare/6caab4d10b13...b643847da55e
<GitHub> artiq/master b643847 whitequark: Fix a misleading message for non-clean kernel termination.
<GitHub> artiq/master 7eb368f whitequark: test: add DMA test that checks the analyzer trace.
<sb0> http://buildbot.m-labs.hk/builders/artiq/builds/1348/steps/python_unittest_2/logs/stdio also has this message on test_rpc_timing after test_watchdog, so it confirm it is completely unrelated to DMA
FabM has joined #m-labs
<bb-m-labs> build #482 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/482
<bb-m-labs> build #1418 of artiq is complete: Failure [failed python_unittest_2] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1418 blamelist: whitequark <whitequark@whitequark.org>
mumptai has joined #m-labs
rohitksingh has quit [Quit: Leaving.]
<GitHub> [artiq] whitequark commented on issue #655: Triage: the upstream still cannot agree on the desired semantics for a feature like this: https://reviews.llvm.org/D20116. I am now more convinced that the patch we have is, in fact, unsound. https://github.com/m-labs/artiq/issues/655#issuecomment-289902948
mumptai has quit [Quit: Verlassend]
<GitHub> [artiq] cjbe commented on issue #670: Using the nist_clock 2.2 package in dev on Win7 this problem is fixed!... https://github.com/m-labs/artiq/issues/670#issuecomment-289918524
<GitHub> [artiq] cjbe commented on issue #670: (Apart from the monkey patch for #506 not working) https://github.com/m-labs/artiq/issues/670#issuecomment-289920855
<GitHub> [artiq] cjbe opened pull request #693: monkey_patches: fix 3af29f7 (master...monkey-patch) https://github.com/m-labs/artiq/pull/693
Gurty has quit [Ping timeout: 264 seconds]
<GitHub> [artiq] klickverbot commented on issue #693: Trivial, merging. https://github.com/m-labs/artiq/pull/693#issuecomment-289932028
<GitHub> [artiq] klickverbot closed pull request #693: monkey_patches: fix 3af29f7 (master...monkey-patch) https://github.com/m-labs/artiq/pull/693
<GitHub> [artiq] r-srinivas opened issue #694: ImportError: cannot import name 'Keep' when installing artiq 2.2 from source https://github.com/m-labs/artiq/issues/694
<GitHub> [artiq] r-srinivas opened issue #695: Installing release 2 from source documentation https://github.com/m-labs/artiq/issues/695
<bb-m-labs> build #483 of artiq-board is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-board/builds/483
<bb-m-labs> build #1419 of artiq is complete: Failure [failed python_unittest_2] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/1419 blamelist: Chris Ballance <chris.ballance@physics.ox.ac.uk>