#m-labs on 2016-04-04 — irc logs at freenode.irclog.whitequark.org

2015-03-04 14:45 sb0 changed the topic of #m-labs to: ARTIQ, Migen, MiSoC, Mixxeo & other M-Labs projects :: fka #milkymist :: Logs http://irclog.whitequark.org/m-labs

03:12 _rht has joined #m-labs

03:37 klickverbot has quit [Ping timeout: 260 seconds]

04:04 <mithro> sb0: How should I go about adding support for using gcc with or1k in misoc?

04:05 <sb0> why do you need that?

04:05 <sb0> otherwise it's just replacing clang with gcc in the makefile

04:06 <mithro> sb0: Because I want to use the same toolchain on lm32 and or1k for the moment

04:07 <mithro> sb0: I just hacked up https://github.com/m-labs/misoc/blob/master/misoc/integration/cpu_interface.py#L6 for the moment

04:08 <sb0> well you cannot, gcc needs different compiler builds for different architecture

04:08 <sb0> so I'm not sure what this adds

04:08 <sb0> you'll need to compile another toolchain anyway

04:12 <mithro> sb0: yes, I've already done that bit - I have conda recipes for lm32 and or1k gcc which seem to work okay. I needed the gcc compiler for or1k to compile linux / rtems as that is what the openrisc guys are developing with anyway

04:18 <mithro> I was thinking that adding a command line flag to https://github.com/m-labs/misoc/blob/master/misoc/integration/builder.py which allowed you to specify which compiler you wanted (with the default being the same as now) would be the correct approach?

04:19 <mithro> sb0: my other thought was using environment variables to override the settings in cpu_interface.py

04:20 <mithro> but that felt more "hacky" ?

04:51 klickverbot has joined #m-labs

04:56 klickverbot has quit [Ping timeout: 264 seconds]

05:18 klickverbot has joined #m-labs

05:23 klickverbot has quit [Ping timeout: 248 seconds]

05:31 evilspirit has joined #m-labs

05:46 klickverbot has joined #m-labs

05:50 klickverbot has quit [Ping timeout: 244 seconds]

05:53 _rht has quit [Quit: Connection closed for inactivity]

06:10 evilspirit has quit [Ping timeout: 244 seconds]

06:13 klickverbot has joined #m-labs

06:17 klickverbot has quit [Ping timeout: 244 seconds]

06:40 klickverbot has joined #m-labs

06:44 klickverbot has quit [Ping timeout: 252 seconds]

07:07 klickverbot has joined #m-labs

07:11 klickverbot has quit [Ping timeout: 240 seconds]

07:34 klickverbot has joined #m-labs

07:39 klickverbot has quit [Ping timeout: 260 seconds]

08:01 klickverbot has joined #m-labs

08:06 klickverbot has quit [Ping timeout: 246 seconds]

08:25 evilspirit has joined #m-labs

08:40 klickverbot has joined #m-labs

09:53 ssk1328 has joined #m-labs

10:39 kuldeep has quit [Ping timeout: 260 seconds]

10:49 ssk1328 has quit [Quit: Page closed]

11:09 kuldeep has joined #m-labs

11:11 <GitHub147> [artiq] sbourdeauducq pushed 4 new commits to master: https://git.io/vVRXm

11:11 <GitHub147> artiq/master 6951613 Sebastien Bourdeauducq: protocols/pc_rpc: add get_local_host to clients

11:11 <GitHub147> artiq/master 059836c Sebastien Bourdeauducq: protocols/remote_exec: give access to controller_initial_namespace

11:11 <GitHub147> artiq/master 4ce00e3 Sebastien Bourdeauducq: protocols/remote_exec: add connect_global_rpc

11:26 <bb-m-labs> build #286 of artiq-kc705-nist_clock is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-kc705-nist_clock/builds/286

11:29 <bb-m-labs> build #545 of artiq is complete: Failure [failed python_unittest_1] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/545 blamelist: Sebastien Bourdeauducq <sb@m-labs.hk>

11:40 klickverbot has quit [Ping timeout: 276 seconds]

11:46 evilspirit has quit [Ping timeout: 268 seconds]

11:50 evilspirit has joined #m-labs

11:59 kuldeep has quit [Ping timeout: 276 seconds]

12:01 klickverbot has joined #m-labs

12:05 klickverbot has quit [Ping timeout: 246 seconds]

12:14 kuldeep has joined #m-labs

12:41 kuldeep has quit [Ping timeout: 268 seconds]

12:57 kuldeep has joined #m-labs

13:42 FelixVi has joined #m-labs

14:17 <GitHub162> [artiq] sbourdeauducq pushed 4 new commits to master: https://git.io/vV0Gy

14:17 <GitHub162> artiq/master f860548 Sebastien Bourdeauducq: protocols/pyon: minor cleanup

14:17 <GitHub162> artiq/master aa61c29 Sebastien Bourdeauducq: transfer Python builtin exceptions over pc_rpc and master/worker

14:17 <GitHub162> artiq/master 7453d85 Sebastien Bourdeauducq: GUI -> dashboard

14:18 <GitHub82> [artiq] sbourdeauducq pushed 1 new commit to release-1: https://git.io/vV0GH

14:18 <GitHub82> artiq/release-1 eba90c8 Sebastien Bourdeauducq: client: add --async option to scan-repository, recommend usage in git post-receive

14:28 FelixVi has quit [Remote host closed the connection]

14:29 <bb-m-labs> build #287 of artiq-kc705-nist_clock is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-kc705-nist_clock/builds/287

14:29 <whitequark> this took me entirely too long but I implemented optimal 64-bit subtraction

14:32 <bb-m-labs> build #546 of artiq is complete: Failure [failed python_unittest_1] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/546 blamelist: Sebastien Bourdeauducq <sb@m-labs.hk>

14:34 <whitequark> http://hastebin.com/asajujehed.avrasm

14:34 <whitequark> rjo ^

14:34 <whitequark> so, a 64-bit sub is a sub+xor+add-with-carry

14:40 <whitequark> you know, this is really stupid, because the l.addc opcode has a reserved bit and the ALU already has all the necessary combinatory logic for subtraction

14:40 <whitequark> they could have added l.subc but did not :/

14:42 <whitequark> could've also used the 0x38,0x1 ALU subrange

14:47 <whitequark> sb0: do you see any use for the MAC module?

14:47 <sb0> no

14:51 <whitequark> 64-bit multiplier?

14:51 <sb0> how does this work? is l.sub touching the carry flag?

14:51 <sb0> doc say it does not

14:52 <whitequark> huh? the doc says that it does

14:52 <whitequark> rD[31:0] ← rA[31:0] - rB[31:0]

14:52 <whitequark> SR[CY] ← carry (unsigned overflow)

14:52 <whitequark> SR[OV] ← signed overflow

14:52 <sb0> http://docs.huihoo.com/openrisc/openrisc1000-arch.pdf

14:53 <sb0> This isntruction does not change carry SR[CY] flag.

14:53 <whitequark> that's the old version of the architecture

14:53 <whitequark> opencores.org/websvn,filedetails?repname=openrisc&path=%2Fopenrisc%2Ftrunk%2Fdocs%2Fopenrisc-arch-1.1-rev0.pdf

14:53 <whitequark> this is the recent revision

14:53 <whitequark> this was revised in 2012

14:53 <sb0> ah yes

15:00 <whitequark> why... why does or1k have separate instructions for extending byte and half-word to register size?!

15:00 <whitequark> well, zero-extending at least, that's just a waste of opcode space, since they're all representible via l.andi

15:01 <whitequark> this is a bizarre architecture

15:02 <sb0> yes

15:09 <sb0> lm32 doesn't have such problems afaik...

15:10 <whitequark> so, about that

15:10 <rjo> whitequark: nice. but how do you teach this to llvm if you say it can't learn to do this?

15:10 <whitequark> with what I leanred while fixing OR1K in the last few days, I'm confident I can quickly implement a decent LM32 backend as well as upstream OR1K

15:10 <whitequark> I understand pretty much all the moving parts necessary for implementing a backend of this complexity now

15:10 <whitequark> rjo: with C++ code.

15:11 <whitequark> it has a SUBE instruction (sub-using-carry) and it has built-in legalization code that translates the 64-bit SUB into SUBE+SUBC

15:11 <whitequark> I lower SUBC to l.sub which does the right thing, and then manually lower SUBE to l.xor+l.addc

15:11 <rjo> by the way. soon there will be many 64 bit subtractions because of latency compensation.

15:11 <whitequark> you will be pleased with their speed, then.

15:12 <whitequark> (and I will be pleased that I didn't waste this time)

15:12 <sb0> whitequark, but then there will be libunwind and all

15:12 <whitequark> (well, not like it would have gone to waste anyway, with all the things I learned...)

15:12 <whitequark> sb0: what about libunwind?

15:12 <sb0> I don't trust it will be bug-free for lm32, if available at all

15:12 <whitequark> you do remember that libunwind wasn't available at all for OR1K?

15:13 <whitequark> OR1K had no exceptions, no DWARF, no debug information whatsoever

15:13 <whitequark> libunwind basically needs setcontext+getcontext and a little bit of boilerplate. and it was bug-free from the start, because that code is just too dumb to have bugs

15:14 <whitequark> there *were* a few bugs in the OR1K frame lowering code, but they would have manifested even without exceptions or DWARF, that just made them manifest earlier, and in easier to debug ways, for that matter

15:16 <sb0> didn't you use something from BSD?

15:16 <whitequark> nope

15:17 <sb0> I remember seeing some OR1K DWARF/unwind support from there

15:17 <whitequark> I have never even heard about that

15:22 evilspirit has quit [Ping timeout: 260 seconds]

15:22 <cr1901_modern> They probably reimplemented something due to licensing concerns and/or the GNU equivalent being crap

15:22 <whitequark> well they sure as hell used binutils, there's no alternative for or1k

15:24 <cr1901_modern> Fair. (Though tbh, I'm a little surprised a binutils alt never came to fruition.)

15:25 <whitequark> sure it did

15:25 <whitequark> LLVM has its own assembler since ages (because it's stupid to fork, serialize and deserialize just to emit machine code)

15:26 <whitequark> now LLVM has its own linker too, and it slowly gains all the loose parts ie ar dwarfdump objdump et cetera

15:28 * sb0 notices that QFileDialog with QFileDialog::DontUseNativeDialog also has table column layout issues

15:29 <cr1901_modern> I guess it's just slow to adopt then. I actually didn't know LLVM had an assembler. Presumably you can write one for any backend you want if motivated?

15:30 <sb0> whitequark, so you're motivated to port everything to lm32?

15:32 <whitequark> sb0: sure, why not? you're saying it provides concrete advantages, and I see that it's not a lot of work

15:32 <sb0> well the architecture is cleaner. but there are no user-visible advantages ...

15:32 <whitequark> we also need to decide something about upstreaming the backends. or1k, lm32, both

15:33 <whitequark> I'm tempted to try it with or1k because it's already there and in a good state, and see how painful it is

15:33 <sb0> on the other hand, a file selector that would not suck clearly would be a user-visible advantage

15:44 <sb0> the kde one is okay, but probably hell to integrate

15:44 <sb0> on windows and all

15:45 <whitequark> might not be that bad actually, but what's wrong with the system one?

15:47 <sb0> I want to customize it in two ways: 1) it should not be a dialog but a permanent part of the application window 2) large icons used as previews (rendered by my application)

15:47 <sb0> the system one supports neither

15:47 <whitequark> I don't think you should base it off the file selector at all, then

16:03 <GitHub95> [artiq] jordens pushed 1 new commit to master: https://git.io/vV0P5

16:03 <GitHub95> artiq/master d095d48 Robert Jordens: gui.models: style

16:11 sb0 has quit [Quit: Leaving]

16:14 <bb-m-labs> build #288 of artiq-kc705-nist_clock is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-kc705-nist_clock/builds/288

16:16 <bb-m-labs> build #547 of artiq is complete: Failure [failed] Build details are at http://buildbot.m-labs.hk/builders/artiq/builds/547 blamelist: Robert Jordens <rj@m-labs.hk>

16:32 <whitequark> mor1kx doesn't even bother to implement the extension instructions

16:32 evilspirit has joined #m-labs

16:32 <whitequark> well, four out of six

16:37 sb0 has joined #m-labs

16:48 evilspirit has quit [Ping timeout: 244 seconds]

17:08 <whitequark> rjo: sb0: wow.

17:09 <whitequark> the 64-bit addc changes have had a *massive* effect, far more than I have anticipated

17:09 <whitequark> specifically, PulseRateDDS is down to 20us

17:10 <whitequark> so... 10us per channel? that's actually better than what the Oxford group wants, isn't it?

17:21 <whitequark> uh

17:21 <whitequark> what

17:22 <whitequark> *enabling* addc while building the runtime makes the test faster, but *disabling* addc while building the kernel *also* makes the test faster?

17:22 <whitequark> a little bit, but it does

17:36 <whitequark> yeah, there's a pretty large amount of l.addic's in dds.o, and a few in rtio.o, i think most of them are dead though

17:36 <whitequark> i wonder what's up with addc slowing down the kernel though

17:47 key2 has joined #m-labs

18:10 <rjo> whitequark: it is what i suggested they can get with drtio. 10us is a useful number for a pulse (dds set and ttl pulse combined). but there will be overhead when actually doing them.

18:11 <whitequark> yes, if you change phase you will immediately have FP in the loop

18:11 <whitequark> (do you change phase?)

18:11 <rjo> all the time

18:12 <whitequark> or if you set phase mode to not continuous, there will be a bunch of 64-bit multiplications in dds_set

18:13 <rjo> not only that but als the overhead of retrieving the pulse data etc. this is not just repeating the same pulse over and over again.

18:13 <rjo> but we really need to leave that for later imho.

18:14 <rjo> now we should prioritize and say that 10us for repeating the sme pulse without phase tracking is good.

18:14 <whitequark> there's the 64-bit multiplier in the ISA but not in mor1kx...

18:15 <rjo> out of curiosity. how did that help for pulse rate ttl?

18:15 <rjo> a 64 bit multiplier would need to be either multi-cycle or bring down the clock speed a lot.

18:16 <whitequark> it didn't. the ttl pulse rate is 1484ns

18:16 <whitequark> pretty much what it was before I started messing with FP, LICM, etc

18:16 <whitequark> this, on the other hand, I actually expected

18:16 <rjo> hmm. there should be heavy 64 bit stuff in there as well.

18:18 <rjo> but also something for later.

18:28 _rht has joined #m-labs

18:41 <bb-m-labs> build #199 of artiq-pipistrello-nist_qc1 is complete: Success [build successful] Build details are at http://buildbot.m-labs.hk/builders/artiq-pipistrello-nist_qc1/builds/199

19:39 <whitequark> hrm, RCA is inconclusive for that addc slowdown, but probably register pressure

19:39 <whitequark> in any case it's 40ns

19:46 kuldeep has quit [Ping timeout: 248 seconds]

19:55 key2 has quit [Ping timeout: 244 seconds]

20:02 kuldeep has joined #m-labs

20:03 kuldeep has quit [Client Quit]

20:18 kuldeep has joined #m-labs

21:33 _rht has quit [Quit: Connection closed for inactivity]

22:04 klickverbot has joined #m-labs

22:08 klickverbot has quit [Ping timeout: 250 seconds]

22:32 <whitequark> sb0: from discussion on #llvm: "like, moving from OR1K to RISC-V would be like moving from a trash can fire to a larger, dumpster-sized fire"

22:38 klickverbot has joined #m-labs

22:41 <cr1901_modern> Surprised. I was under the impression that RISC-V was the most popular out of LM32,OR1K,and RISC-V. But then again, most popular != best.

22:42 <cr1901_modern> (I've been told that "one reason LM32 is ignored is that it's 32-bit only")

22:42 <cr1901_modern> although I seem to recall that data width is adjustable? *checks*

22:43 <whitequark> datapath width is not really the same as register width

22:56 klickverbot has quit [Quit: No Ping reply in 180 seconds.]

22:56 klickverbot has joined #m-labs

22:57 <cr1901_modern> Yea, I'm not sure where I was going with that in retrospect.

22:57 <cr1901_modern> sb0: Ping.

23:03 <whitequark> rjo: sb0: we can't use overflows in OR1K.

23:03 <whitequark> none of the OR1K shifts set overflow (or carry, for that matter) bits

23:03 <whitequark> LLVM will transform *2 into <<1 in instcombine (and do other similar things)

23:03 <whitequark> which is, of course, not only legal but desirable.

23:04 <whitequark> not only this will make code *much* slower but also I don't think that optimization can even *be* turned off, it's considered target-independent

23:14 <GitHub64> [conda-recipes] whitequark pushed 2 new commits to master: https://github.com/m-labs/conda-recipes/compare/90738936fa7c...87172c701297

23:14 <GitHub64> conda-recipes/master 004e9e4 whitequark: llvm-or1k: bump.

23:14 <GitHub64> conda-recipes/master 87172c7 whitequark: llvmlite-artiq: bump.

23:14 <whitequark> bb-m-labs: force build --props=package=llvm-or1k conda-all

23:14 <bb-m-labs> build forced [ETA 43m59s]

23:14 <bb-m-labs> I'll give a shout when the build finishes

23:21 sandeepkr has joined #m-labs

23:46 klickverbot has quit [Ping timeout: 244 seconds]