#m-labs on 2015-07-16 — irc logs at freenode.irclog.whitequark.org

2015-03-04 14:45 sb0 changed the topic of #m-labs to: ARTIQ, Migen, MiSoC, Mixxeo & other M-Labs projects :: fka #milkymist :: Logs http://irclog.whitequark.org/m-labs

00:10 ylamarre has quit [Quit: ylamarre]

00:58 <rjo> sb0: when i tried, different sys clock speeds it didnt matter

00:59 <rjo> there seem to be three possible results: 1) phase four hits ~1000 hold or setup violations and everything passes fine. 2) it finds ~30000 and hangs 3) finds 30000 and fails.

01:01 <whitequark> is it nondeterministic?

01:01 <rjo> but maybe slower clock reduces the rate of failures/hangs. we should definitely parallelize the bitstream builds.

01:01 <rjo> very much so.

01:02 <whitequark> that's gross

01:03 <whitequark> doesn't it have some kind of -srand switch?

01:03 <whitequark> rm /dev/urandom; dd if=/dev/zero of=/dev/urandom bs=1M count=100

01:04 <rjo> and i did not find a way to srand() it.

01:04 <rjo> is the random context per-process?

01:05 <whitequark> there's no random context in linux

01:05 <rjo> maybe LD_PRELOAD.

01:05 <whitequark> libc reads a bunch from urandom during startup

01:05 <rjo> there is srand()

01:05 <rjo> that establishes a context.

01:05 <whitequark> well, it changes the libc's seed

01:06 <whitequark> xst does not have to use libc's seed

01:06 <rjo> you want to say: libc chooses a random default seed.

01:06 <whitequark> in fact, given how shitty libc's RNG often is, I would expect it not to

01:06 <whitequark> sure

01:06 <whitequark> I only meant that libc's random is not special in any way

01:08 <rjo> yes. i suspect that one could do srand() manually and then replace it with a stub if all randomness in xst comes from libc rand()

01:08 <whitequark> sure, LD_PRELOAD would do that

01:08 <whitequark> I have some snippets ready; want me to write it for you?

01:21 <GitHub193> [artiq] whitequark pushed 3 new commits to new-py2llvm: http://git.io/vmROo

01:21 <GitHub193> artiq/new-py2llvm 9ff9f85 whitequark: Add accessors to instructions.

01:21 <GitHub193> artiq/new-py2llvm c724e02 whitequark: Fix inference for multiple-target assignments.

01:21 <GitHub193> artiq/new-py2llvm 7c5e40c whitequark: Add inference for Index, Slice and ExtSlice.

01:23 <GitHub128> [artiq] whitequark force-pushed new-py2llvm from 7c5e40c to 227f97f: http://git.io/vmI6O

01:23 <GitHub128> artiq/new-py2llvm 227f97f whitequark: Add inference for Index, Slice and ExtSlice.

01:32 <whitequark> does anyone want slices with a stride ("step")?

01:33 <whitequark> like x[1:10:2]

01:33 <whitequark> altho nevermind, they're not hard to implement

01:57 <whitequark> huh. index operations are... costly

02:03 bentley` has quit [Ping timeout: 264 seconds]

02:27 <whitequark> sb0: why don't you compile the kernels as PIC?

02:27 <whitequark> that would solve the relocation problem *and* simplify the loader

03:44 mithro has quit [Read error: Connection reset by peer]

04:08 mithro has joined #m-labs

06:03 <mithro> Is there a quick way to get the bios to write to multiple uarts?

06:37 <whitequark> python is deeply disturbing

06:38 <whitequark> this is what x=[0,1,2]; x[1:2] expands to: https://i.imgur.com/Az0NLyG.png

06:40 <whitequark> well, in this particular case, DCE and CSE will have a feast upon it

07:05 <sb0> whitequark, you need a linker anyway to resolve the syscalls

07:05 <sb0> and isn't PIC slow?

07:06 <sb0> plus requires registers to be loaded with obscure values

07:06 <whitequark> yes, you need a linker, but there's basically one reloc you will ever need to handle

07:06 <whitequark> PIC slow? uh

07:06 <whitequark> not really, no, it's slow on 32-bit x86 because it has too few registers

07:06 <whitequark> or1k has too much, if you ask me

07:07 <whitequark> doesn't need registers to be loaded with obscure values, the linker will generate the PLT, and it will be just a bunch of relocs

07:07 <whitequark> and have the standard calling convention

07:08 <whitequark> (if calling a PIC function would have been any different than a normal one, you couldn't take its address and pass it somewhere)

07:08 <sb0> well, don't you need a an extra indirection on memory accesses with PIC?

07:09 <whitequark> for globals defined inside the module, yes

07:09 <whitequark> py2llvm doesn't generate any globals inside the module

07:09 <sb0> and the offset of that indirection needs to be stored and kept at all times in the registers

07:09 <sb0> using some undocumented, obscure mechanism

07:09 <whitequark> um, no?

07:10 <whitequark> it's documented. it's in the platform ABI

07:10 <whitequark> and you don't have to do anything to maintain the GOT offset register

07:11 <whitequark> the compiler does it

07:11 <sb0> my general assumption is that those fancy features are broken unless you use ARM or x86

07:11 <sb0> but I might be wrong

07:11 <whitequark> you're definitely wrong about PIC

07:11 <whitequark> in fact, most sane platforms use PIC by default

07:12 <whitequark> e.g. amd64

07:12 <whitequark> because there isn't a penalty for doing so

07:13 <whitequark> or1k supports PIC alright

07:14 <whitequark> not that it would have mattered, because, well, no globals

07:14 <sb0> PIC is not documented for or1k afaict

07:15 <whitequark> r16

07:15 <whitequark> see OR1KInstrInfo.cpp:215 for example

07:17 <whitequark> the idea is that you have a table of globals, which is placed at a known position relative to code

07:17 <whitequark> and in a function that uses globals you reconstruct a pointer to GOT by using pc-relative addressing

07:18 <whitequark> by the way, it is documented. http://opencores.org/or1k/OpenRISC_PIC

07:19 <sb0> The PIC sub-project aims to add support for position independent code (PIC) for the uClibc/Linux version of the GNU tool chain. This will require extensions to the ABI (which currently has no specification for PIC) and a rewrite of much of the tool chain.

07:19 <sb0> doesn't inspire trust

07:21 <sb0> is the LLVM code looking good?

07:21 <whitequark> yes

07:22 <whitequark> linker might be worse, I will check it a bit later

07:23 <whitequark> the or1k backend is surprisingly well written overall, I expected worse

07:23 <whitequark> we could actually upstream it with relatively little modifications

07:30 <whitequark> sb0: oh, binutils are in the clear

07:30 <whitequark> it supports not only PIC, but even TLS (!)

07:31 <whitequark> (TLS often has spotty support bc unlike PIC it requires OS/runtime support)

07:33 <whitequark> disappointingly, the LLVM backend doesn't have TLS support

07:50 <whitequark> sb0: one reason i'm interested in this, is that it would be good to have decent exception support

07:50 <whitequark> not just "ValueError, in one of the two dozen places it could have been raised"

07:51 <whitequark> but a file:line1:col1:line2:col2.

07:51 <sb0> what does this have to do with PIC?

07:51 <whitequark> string literals

07:51 <whitequark> rodata

07:51 <whitequark> needs linker changes

07:51 <whitequark> easier to just load PT_LOAD and forget about trying to muck with sections.

07:51 <whitequark> like pretty much every other dynamic linker in existence does

07:51 <sb0> well, options are

07:52 <sb0> 1) add that one relocation type needed for rodata

07:53 <whitequark> either works i guess

07:53 <sb0> 2) use PIC, risk bugs, remove some of the existing linker code which is good

07:53 <whitequark> i'd still look at 2 though

07:54 <whitequark> is there an or1k simulator?

07:54 <whitequark> or do I have to wait until pipistrello arrives to test?

07:55 <whitequark> ... i suppose there wouldn't be a simulator with misoc

07:55 <sb0> there is verilator-based simulation

07:55 <whitequark> does it work?

07:55 <sb0> yes. it works very well, though it is slow (~1MHz)

07:55 <ysionneau> https://github.com/openrisc/mor1kx/wiki/Setting-up-a-mor1kx-development-and-verification-environment#Hello_world_simulation

07:55 <ysionneau> and there is a javascript simulator also (jor1k)

07:56 <sb0> but it can even potentially simulate the RTIO core and more

07:56 <ysionneau> https://s-macke.github.io/jor1k/demos/main.html?user=DODTzyNCq7

07:56 <whitequark> 4.7 MIPS

07:57 <whitequark> not very impressive compared to rtl-level sim

07:57 <sb0> there is some QEMU support for or1k as well

07:57 <sb0> ...lm32 has good QEMU, but crappy LLVM :(

07:58 <whitequark> well, that can be easily fixed once I finish this all

07:58 <whitequark> always wanted to write a decent LLVM backend

08:04 <whitequark> sb0: oh, while I'm at it, I have this draft for new interleave transformation impl

08:04 <whitequark> every function signature will get a "duration" field. duration(10ns) etc. duration will be calculated purely lexically, during the typechecking phase

08:05 <whitequark> so, there will be no DCE, no constant propagation, and no inlining preceding that

08:06 <whitequark> the only potential problem with this design I see is inability to do something like constant=10; delay(constant*ms)

08:06 <whitequark> I don't know how common this is and if it is, this can be improved upon

08:06 <whitequark> anyway, this information allows us to completely validate the possibility of interleaving during the typechecking phase

08:08 <whitequark> then, after IR generation, functions will be inlined. as a bonus, inlining will only go as far as necessary, i.e. if the process a function is matched with finishes after the function does, it doesn't have to be expanded

08:08 <whitequark> same about loop unrolling

08:08 <whitequark> questions.

08:08 <whitequark> 1) how common would be non-literal expressions to delay? how hard they would be to compute statically, at most?

08:09 <whitequark> 2) I assume the loop unrolling is meant to be used on loops with statically known iteration count, like range(10)

08:09 <whitequark> since arrays do not have statically known size now, it would not be possible to meaningfully unroll a loop over an array

08:10 <whitequark> is this right?

08:34 <sb0> there will be a few non-literal expressions to delay

08:34 <sb0> a common case is scanning timing

08:34 <sb0> and unrolling the scan loop isn't a good option as those can be large

08:36 <sb0> note that in this case the scanned time may be passed as parameter to a function, e.g. ttl.pulse()

08:37 <sb0> I think that loop unrolling and function inlining should be driven by interleave requirements...

08:37 <whitequark> that's exactly what I'm saying

08:37 <whitequark> if interleave can fit it without expansion, so be it

08:39 <sb0> and e.g. ttl.pulse() cannot get a "duration" field because that duration depends on its parameters, so it would always get inlined. right?

08:39 <sb0> well, not always, but when there is a parallel/sequential block to lower

08:40 <whitequark> well, not quite

08:40 <sb0> whereas functions that have constant time and a duration field could be called (as functions) after parallel/sequential lowering

08:40 <whitequark> let's say you have a function with duration 100 and ten delays of 10, and another with duration 100 and two delays of 50

08:41 <whitequark> so to interleave these, with known durations, you'll still have to inline

08:41 <whitequark> similar case with loop unrolling

08:41 <whitequark> if you cannot definitely compute a duration, it's more problematic, because what I'm trying to do is to get the duration fully known after the typechecking

08:42 <whitequark> let me think about implementing that

08:42 <sb0> yes, you cannot definitely compute a duration in some cases.

08:44 <whitequark> how about this: a duration field could be a number, or it could be an expression, using just addition and multiplication, which would incorporate function parameters

08:44 <sb0> the general case with dynamic durations requires coroutines, which we don't want because they are slow and/or complicated

08:44 <sb0> but the compiler should implement as much as possible of those cases that don't need coroutines

08:44 <whitequark> wait

08:44 <whitequark> slow and/or complicated?

08:45 <whitequark> I was going to ask later whether you all want generators, because there's very little additional work to implement them

08:45 <whitequark> basically the only thing that changes is the function's environment will be allocated in the parent's stack frame instead of its own

08:46 <sb0> context switching between coroutines is slow, yes

08:46 <whitequark> and it gets a "state" internal variable and a switch statement that dispatches it on reentry

08:47 <whitequark> it is as costly as a function call and one indirect jump

08:47 <whitequark> is that too much?

08:47 <sb0> I'm not sure if there are any practical uses for them

08:47 <sb0> well

08:48 <sb0> maybe for implementing complex scanning actually

08:48 <whitequark> I don't think these would be problematic. there's a bit tricky case with nested coroutines, but apart from that, it's simple

08:49 <whitequark> what *I* am not sure however

08:49 <whitequark> is how coroutines help with interleaving

08:49 <whitequark> well, you can rewrite delay(ns) to yield(ns) and basically make with parallel a scheduler

08:50 <sb0> we want to be able to create iterators that scan over a range of values, possibly picking them at random or not with a constant interval between each point

08:50 <sb0> generators may help with that

08:50 <sb0> generators do not help with interleaving. and yes, "yield ns" is what I mean.

08:51 <whitequark> I mean, generators == coroutines

08:51 <sb0> yes

08:51 <whitequark> ok

08:51 <whitequark> let's not look at generators then, until we definitely need them

08:51 <whitequark> 08:44 < whitequark> how about this: a duration field could be a number, or it could be an expression, using just addition and multiplication, which would incorporate function parameters

08:51 <whitequark> what do you think about this

08:52 <whitequark> you could still always compute it from a signature. it is also composable.

08:53 <whitequark> and you don't even have to inline

08:53 <sb0> that won't work

08:53 <whitequark> why?

08:53 <sb0> you need to interleave the inside of functions. a simple case is with parallel: a.pulse(10*us) b.pulse(20*ns)

08:53 <whitequark> sure

08:54 <whitequark> let's say pulse is defined as:

08:54 <sb0> that gets lowered to: a.on() b.on() delay(10*us) a.off() delay(10*us) b.off()

08:54 <whitequark> def pulse(x): self.on(); delay(x); self.off(); delay(x)

08:54 <whitequark> then its signature will look like: (x=int; duration=(x+x))->None

08:54 <sb0> why the final delay?

08:55 <whitequark> illustrative purposes

08:55 <whitequark> so if you want something like

08:56 <whitequark> def pulsen(x, n): for _ in range(n): pulse(x)

08:56 <whitequark> it will get duration=n*(x+x)

08:56 <sb0> ok, I understood

08:57 <whitequark> the advantage over abstract interpretation is that this scheme is very transparent. you can inspect every piece and they will always get the exact same type regardless of context

08:57 <sb0> but my main critique is this won't handle "with parallel: a.pulse(10*us) b.pulse(20*ns)"

08:57 <whitequark> why not?

08:57 <sb0> because you need to break down/inline pulse()

08:57 <sb0> before interleaving

08:57 <whitequark> why?

08:58 <sb0> because the correct lowered result is "a.on() b.on() delay(10*us) a.off() delay(10*us) b.off()"

08:58 <whitequark> sure

08:58 <sb0> and you can't get that without looking into each statement of pulse()

08:58 <whitequark> the interleaving transformation itself inlines

08:58 <whitequark> but this happens after computing the duration

08:59 <whitequark> so after the typechecking, you'll know that a.pulse(10) executes for 20us, and b.pulse(20) executes for 40

08:59 <sb0> so why bother with computing durations? reducing the amount of functions that end up inlined?

08:59 <whitequark> error reporting. with the scheme I am proposing, the computation of duration is completely local

09:00 <whitequark> the computed duration, or impossibility of computing one, depends only on the lexical content of the function

09:00 <whitequark> which makes it easy to explain why was it not possible to do so

09:00 <whitequark> whereas if you inline three levels deep, how are you going to map your error back to your original code?

09:01 <whitequark> less inlining is a minor bonus

09:04 <whitequark> not to mention this is quicker to implement than abstract interpretation, because I don't need DCE, SCCP, etc

09:05 <sb0> you also have to deal with at_mu()

09:06 <whitequark> what does that do?

09:06 <sb0> set now() to an absolute timestamp. that will throw off your duration computation...

09:06 <sb0> a common use case is:

09:07 <sb0> t = signal_input.timestamp_mu(); at_mu(t); delay(...); signal_output.do_something(...)

09:08 <whitequark> can you even interleave waiting at at_mu() with delay()s?

09:08 <whitequark> I don't see how that can be done statically

09:08 <sb0> no, this obviously cannot be interleaved

09:09 <whitequark> then why is this a problem? the type-level duration is only used for interleaving

09:09 <whitequark> if you try to interleave a function containing that, it will point at at_mu and become angry

09:09 <sb0> so it would have a "complicated duration" flag?

09:09 <whitequark> basically, with a diagnostic hidden inside

09:10 <whitequark> the diagnostic will not be shown except if something requires interleaving

09:10 <sb0> same as if the duration would not be a polynomial expression of function params and constants?

09:10 <whitequark> yes

09:10 <whitequark> well, different message, obviously

09:10 <whitequark> but same idea

09:11 <sb0> so this "duration" flag is used solely for getting better error messages. ok, good.

09:12 <whitequark> it will also be used to decide whether you can avoid inlining

09:12 <whitequark> since if you interleave a function which takes 10us, no matter how much delays inside, with a 20us delay, no point in that.

09:12 <whitequark> but otherwise, yes.

09:13 <sb0> the only drawback I see is this will fail to interleave those cases when the duration expression is non polynomial, but which could still be resolved by constant propagation/DCE

09:13 <whitequark> I actually consider this a feature

09:13 <sb0> eg. if x > 5: delay(5*us) else: delay(10*us)

09:14 <whitequark> because if you do this, you introduce global dependencies all across your program, and when you refactor it, it will break in contrived ways

09:14 <whitequark> that are, which is the motivating part, not at all expressible with sane error messages

09:15 <whitequark> I mean, the best you can do is to print the entire history of abstract computation you performed that led to the failing case

09:15 <whitequark> which is not very helpful and is also a lot of work to actually display

09:15 <whitequark> if really desired, you can bring additional clauses into the inferred duration expression. even if expression above, why not

09:16 <whitequark> so if there is some common non-polynomial thing we need to support, it can be done

09:16 <whitequark> duration(5 if x > 5 else 10)

09:16 <sb0> yes, but this support is already there in CP/DCE

09:17 <sb0> or are you considering leaving CP/DCE to LLVM entirely?

09:17 <whitequark> absolutely

09:17 <whitequark> it has enough type information with the design I have. it will do as good a job as I can

09:18 <whitequark> (probably even better, seeing as it has passes like scalar evolution)

09:42 <mithro> how do I configure the speed of the uart in misoc MiniSoC?

09:46 <ysionneau> you can play with uart_baudrate parameter of SoC Module

09:47 <ysionneau> something like -Ot uart_baudrate <value> when using make.py

09:49 <mithro> thanks!

09:51 <mithro> ysionneau: is there an easy way to reach into the self.submodules.uart_phy and map the output value to a second pin?

09:54 <sb0> mithro, platform.lookup_request("serial").tx

10:21 <mithro> sb0: and how do I map that to an IO pin? I would have thought self.comb += [platform.lookup_request("debug").eq(platform.lookup_request("serial").tx)] would work - but it can't find the "debug" specifier in the IOs?

10:24 <mithro> oh, that needs to be

10:24 <mithro> self.comb += [platform.request("debug").eq(platform.lookup_request("serial").tx)] it seems

10:36 <mithro> okay, I'm now at the stage that I can see the UART output from misoc and the UART output from the USB-UART - but still no data is going between them...

10:40 <mithro> timing all looks fine....

10:48 FabM has quit [Quit: ChatZilla 0.9.91.1 [Firefox 39.0/20150630154324]]

10:49 <mithro> well, the misoc side can receive the data and echo it back, so it just looks like the path from the USB-UART up to the computer that is borked...

11:15 <sb0> lookup_request is for fetching a io pin that has already been requested before

11:16 <olofk> Just read through the back logs.

11:17 <olofk> When it comes to hold time violations in ISE, in my experience these are almost always caused by unhandled CDCs in other parts of the design that makes the router try too hard to meet unnecessary constraitns

11:18 <olofk> I would recommend taking a look at all CDCs and handle them individually. That will probably fixes the hold time violations

11:19 <olofk> Also, the golden reference or1k simulator is or1ksim. It's a C model that's pretty fast

11:20 <olofk> And I think the info on the PIC stuff is out of date. Not entirely sure, but I believe that information is now fixed in the arch spec

11:59 <GitHub3> [artiq] whitequark pushed 7 new commits to new-py2llvm: http://git.io/vmEjt

11:59 <GitHub3> artiq/new-py2llvm 53fb03d whitequark: Restrict comprehensions to single for and no if clauses.

11:59 <GitHub3> artiq/new-py2llvm e9416f4 whitequark: Convert Slice into typed SliceT.

11:59 <GitHub3> artiq/new-py2llvm 5000f87 whitequark: Rename the field of CoerceT from expr to value.

11:59 <whitequark> does or1ksim have any IO though?

11:59 <whitequark> how is that handled?

12:36 <GitHub162> [artiq] whitequark pushed 1 new commit to new-py2llvm: http://git.io/vmuOZ

12:36 <GitHub162> artiq/new-py2llvm 5756cfc whitequark: Correctly infer type of list(iterable).

12:55 ylamarre has joined #m-labs

12:56 ylamarre has quit [Client Quit]

13:00 <GitHub158> [artiq] whitequark pushed 1 new commit to new-py2llvm: http://git.io/vmu4E

13:00 <GitHub158> artiq/new-py2llvm bcd1832 whitequark: Ensure bindings are created in correct order for e.g. "x, y = y, x".

13:04 ylamarre has joined #m-labs

13:05 <sb0> in Qt, retrieving the text of a QLineEdit: widget.text(). of a QComboBox: widget.currentText()

13:06 <whitequark> QLineEdit is a kind of label, whereas QComboBox isn't

13:06 <whitequark> *why* QLineEdit is a kind of label is beyond me

13:07 <whitequark> hm, there's some awkward interaction between exception handling and allocation

13:07 <whitequark> you can't restore the stack pointer blindly when longjmp'ing

13:08 <whitequark> e.g. def f(): try: x = [1]; raise E; except E: x[0] # segfault

13:09 <whitequark> so I would essentially have to update the stored stack pointer in the last jmpbuf before every call or raise

13:09 <whitequark> ... which is *exactly* what SjLjEHPrepare was designed to do. but alas

13:09 <whitequark> by the way, if you wanted a reason as to why the intrinsics were kind of weird, this is why

13:10 <sb0> isn't the stack pointer offset once and for all in the function prologue?

13:10 <whitequark> nope

13:10 <whitequark> since I'm using stack for dynamic allocation

13:11 <whitequark> imagine this: while(true) { void *x = alloca(10); }

13:11 <whitequark> this will continually advance the stack pointer down. the spill slots and locals will be addressed as frame-point-relative though

13:12 <sb0> yeah, sure

13:12 <whitequark> x = [1] is an alloca inside

13:12 <sb0> so you are implementing dynamic allocation, e.g. [0]*some_complicated_algo() is valid code?

13:12 <whitequark> sure

13:13 <whitequark> it doesn't really matter that lists can be dynamically sized, because even if they were statically sized, you could put them one inside another

13:13 <whitequark> well

13:13 <whitequark> basically, allocate and let them escape the immediate vicinity of allocation

13:13 <whitequark> so everything that you allocate needs to stay alive until the function finally returns

13:13 <sb0> I see

13:14 <sb0> what do you propose? implement sjlj intrinsics into the or1k backend?

13:14 <whitequark> no

13:14 <whitequark> well, actually, I'm not sure, maybe yes

13:14 <whitequark> I should look what is easier, adding the intrinsics or implementing that functionality myself

13:17 <whitequark> sb0: are you *sure* you don't want to use libunwind?

13:17 <whitequark> that will give us backtraces and EH support with pretty much no development cost

13:17 <whitequark> LLVM already generates suitable DWARF and there is an OR1K port

13:18 <whitequark> ("porting" libunwind consists of adding two assembly stubs to save/restore all registers)

13:21 <whitequark> raising from C is calling _Unwind_Raise with the right arguments

13:22 <whitequark> catching unhandled exceptions from C is wrapping _Unwind_Raise; it will tell you when there are no suitable handlers

13:22 <sb0> well, that dynamic stack pointer is a pretty serious problem, so I guess the options are a) hack jmpbufs b) intrinsics c) libunwind

13:22 <whitequark> yes

13:22 <sb0> a) sounds pretty bad

13:22 <sb0> what are the pros and cons of b vs. c

13:23 <whitequark> b: more development time spent on backend, then again when lm32 support is needed. however, simpler runtime behavior

13:23 <whitequark> c: virtually zero development time (dwarf output requires no target-specific code and libunwind porting is trivial), more complex runtime (includes libunwind, reads DWARF tables)

13:24 <whitequark> c is also zero-cost on fast path

13:24 <whitequark> given that basically everything can raise (IndexError, ValueError, etc) there *might* be some usefulness to that

13:24 <whitequark> but also might not

13:24 <sb0> does libunwind deal with reading the DWARF tables?

13:24 <sb0> how much glue is needed?

13:24 <whitequark> that's pretty much its purpose

13:25 <whitequark> very little. most of the annoying parts are generated by LLVM itself

13:25 <whitequark> you need to set up a specific landing pad structure, provide a personality routine, and call _Unwind_Raise

13:26 <whitequark> and put your unhandled exn handler into whatever place _cxa_terminate is placed by clang

13:27 <whitequark> http://cvsweb.netbsd.org/bsdweb.cgi/src/sys/lib/libunwind/unwind_registers.S?rev=1.16&content-type=text/x-cvsweb-markup&only_with_tag=MAIN

13:27 <whitequark> Ctrl+F or1k

13:27 <sb0> ok. what does the linker need to do? just extract one pointer to the DWARF table and give it to libunwind?

13:28 <sb0> load the section into memory as well I guess

13:28 <whitequark> nothing specific beyond loading PT_LOAD segments and resolving relocs

13:29 <whitequark> libunwind discovers the tables via specially crafted function metadata

13:29 <sb0> erk, c++

13:29 <whitequark> C++ what?

13:29 <sb0> libunwind is written in C++, isn't it?

13:30 <whitequark> ah, yeah

13:32 <whitequark> well, one of them is. there are three different libunwinds, at least

13:32 <whitequark> if I remember correctly we need the libc++ one

13:33 <whitequark> there's also GNU libunwind, which is unrelated to what we need

13:33 <sb0> http://libcxx.llvm.org/ ?

13:33 <sb0> I wonder how all this stuff will work on bare metal or1k...

13:34 <whitequark> the beauty of DWARF unwinding is that it has no OS dependencies

13:34 <whitequark> assuming something has resolved relocations (which could be a static linker for all it knows)

13:34 <sb0> but C++ and its standard libraries are usually disgusting beasts with a lot of OS dependencies

13:35 <whitequark> I assure you libunwind is carefully written to run on bare-metal

13:35 <whitequark> I don't think it even depends on libc++abi

13:36 <whitequark> https://llvm.org/svn/llvm-project/libunwind/trunk/src/ this one

13:38 <sb0> ok...

13:38 <sb0> well I'd say give it a try then

13:38 <whitequark> it uses, uh, fprintf(stderr) for debugging

13:38 <whitequark> and that should be the extent of its OS dependencies

13:38 <sb0> #define fprintf( ...

13:39 <whitequark> for example

13:39 <sb0> I'm also a bit worried about GCC miscompiling C++, but let's see

13:40 <whitequark> the backend for C++ is not that different from C

13:40 <whitequark> I mean, it doesn't even use vtables for all I know

13:40 <sb0> on LM32, this was a problem with Qt, but not with simpler programs like antigrain

13:41 <sb0> I don't know about or1k

13:43 <sb0> but you know what my assumption is regarding fancy features in open source CPU toolchains...

13:43 <whitequark> yeah

13:43 ylamarre1 has joined #m-labs

13:45 ylamarre has quit [Ping timeout: 246 seconds]

14:26 <GitHub114> [artiq] whitequark pushed 2 new commits to new-py2llvm: http://git.io/vmzvC

14:26 <GitHub114> artiq/new-py2llvm f8e51e0 whitequark: Add zero/one accessors to TBool, TInt, TFloat.

14:26 <GitHub114> artiq/new-py2llvm 2dcb744 whitequark: Fix inference for default arguments.

14:43 cr1901 has joined #m-labs

15:11 MiW has quit [Ping timeout: 256 seconds]

15:12 MiW has joined #m-labs

15:41 cr1901 has quit [Ping timeout: 250 seconds]

15:43 cr1901 has joined #m-labs

16:16 ylamarre1 has quit [Ping timeout: 250 seconds]

17:10 ylamarre has joined #m-labs

17:11 bentley` has joined #m-labs

17:16 <rjo> whitequark: yeah. if you have LD_PRELOAD code for srand(), that would be great.

17:19 sb0 has quit [Remote host closed the connection]

17:21 sb0_ has joined #m-labs

17:35 <sb0_> ysionneau, we're going to need pyqtgraph from git.

17:35 <sb0_> the release won't tell you when a dock is closed (meh)

17:35 <sb0_> (one day, i'll find a gui lib that does not suck...)

17:35 <ysionneau> =)

17:35 <ysionneau> ok, I'll do the package right away

17:36 <ysionneau> not much to change

17:36 <sb0_> also there are non-closable docks

17:36 <sb0_> which would be good if they were implemented correctly

17:37 <sb0_> but detaching a dock into a window and then closing that window closes the dock, no matter if it's closable or not

17:37 <ysionneau> -_-

17:37 <ysionneau> great

17:40 <ysionneau> sb0_: do I fix the package to using current HEAD of "develop" branch ( a6d5e28 ) or do I just follow the branch?

17:40 <ysionneau> if you already tested a6d5e28 then maybe let's chose that

17:40 <sb0_> ok...

17:56 cr1901 has quit [Ping timeout: 244 seconds]

17:58 cr1901 has joined #m-labs

18:00 <GitHub4> [artiq] fallen pushed 1 new commit to master: http://git.io/vmg6W

18:00 <GitHub4> artiq/master 78ee4bd Yann Sionneau: pyqtgraph: use more up to date revision a6d5e28 on develop branch

18:04 <sb0_> to its credit, pyqtgraph does the right thing when you create an empty plot, add data, zoom/scroll with the mouse, and add more data

18:04 <sb0_> I was afraid this would be another mess

18:08 <cr1901> ysionneau: https://twitter.com/cr1901/status/621742974800519168

18:09 <ysionneau> ah good

18:09 <sb0_> cr1901, what's special about this?

18:09 <sb0_> intel gfx on bsd?

18:09 <cr1901> I didn't know Net was usable as a desktop system

18:10 <cr1901> And yes, that too

18:10 <ysionneau> cr1901: you built from source provided my Mozilla or from pkgsrc?

18:10 <sb0_> if you like driver problems, compiling stuff and editing obscure config files, yes, it's usable

18:11 <cr1901> sb0_: It's an experiment more than anything. And I like compiling stuff.

18:11 <cr1901> ysionneau: pkgsrc. I very much value my well-being.

18:12 <ysionneau> :)

18:13 <cr1901> sb0_: And yes, there are driver problems. USB 3.0 doesn't work. The Intel wireless driver can tx/rx data, but >>

18:13 <cr1901> will go for long periods of time w/o tx/rx data (buffer problem?)

18:13 <sb0_> if bsd has a baytrail emmc driver that doesnt result in windows 95 level system stability, I may use it again...

18:15 <cr1901> How long ago did you last use it? I remember when Linux would kernel panic when I tried using SDXC cards lol.

18:15 <sb0_> maybe windows me, even

18:15 <ysionneau> NetBSD 7 (or 6 or 5) installer kernel doesn't even boot on my Core2Duo macbook pro :(

18:15 <cr1901> I don't remember how I got 6 onto my laptop. It wasn't a CD though. I then used sysupgrade to download 7

18:16 <sb0_> freebsd boots on that crappy lenovo tablet. but it cowardly refuses to detect the emmc :)

18:16 <ysionneau> and freebsd tends to have more contributions and more "modern hw" support than netbsd...

18:16 <ysionneau> so...

18:17 <cr1901> Well yea, everyone loves Free

18:17 <ysionneau> anyway, I'm good with Elementary OS for now

18:18 <cr1901> I'll play around with it, see if I can't get a dev environment installed. Then back to Windows until I decide to try kernel dev lol.

18:19 <cr1901> sb0_: What do forums say about Free's emmc detection?

18:21 <sb0_> "when UEFI-booting thesdhci-controllers is not detected"

18:21 <sb0_> and I cannot disable UEFI, it seems

18:23 * ysionneau has a Bay Trail-I (Silvermont) E38xx CPU on hiw minnowboard MAX

18:23 <ysionneau> his*

18:25 <cr1901> Well (unpopular, uninformed opinion), UEFI is garbage that just replaces the problems it solves w/ BIOS w/ a whole new set

18:25 sb0_ has quit [Ping timeout: 244 seconds]

18:26 sb0 has joined #m-labs

18:29 * cr1901 uses a Lenovo THinkpad from 2011. Still a respectable laptop.

18:31 <cr1901> ysionneau: NetBSD doesn't have sound mixing built into the kernel. I've read that esound can do the mixing, but can PulseAudio do it as well?

18:32 <sb0> maybe this works: https://patrikesn.wordpress.com/2015/01/11/guide-unlocking-the-hidden-bios-pages-on-lenovo-miix-2-11/

18:34 <ysionneau> cr1901: for desktop related questions about NetBSD you can ask around on #EdgeBSD , khorben is maintaining his own desktop environement for NetBSD called DeforaOS

18:34 <ysionneau> he has well thought about a lot of desktop related stuff

18:34 <cr1901> Alright- so they'll accept plain vanilla Net q's then?

18:34 <ysionneau> yes, absolutely

18:37 <ysionneau> ah, there is a uefi variable you can change to access more functions ... nice

18:37 <sb0> nice? no.

18:37 <ysionneau> well, not nice, but good that it at least exists, even if it should be available directly

18:37 <sb0> also, my bios uses different format

18:37 <ysionneau> a konami code would have be more fun than having to edit with hex editor though ...

18:38 <ysionneau> been*

18:38 <ysionneau> ah so you cannot tweak the same exact byte?

18:38 <cr1901> Awesome, good to know others have given it thought.

18:40 <cr1901> I'm sure Free is great too. I just had better, more fun experiences running Net on stuff (bucket list would be a 68k machine).

18:40 <sb0> if those "others" were lenovo intel, _then_ it would be good

18:40 <sb0> and make that thing just fucking work

18:40 <sb0> *made

18:42 travis-ci has joined #m-labs

18:42 <travis-ci> m-labs/artiq#312 (master - 78ee4bd : Yann Sionneau): The build passed.

18:42 <travis-ci> Build details : https://travis-ci.org/m-labs/artiq/builds/71296189

18:42 travis-ci has left #m-labs [#m-labs]

18:48 cr1901 has quit [Ping timeout: 264 seconds]

18:50 cr1901 has joined #m-labs

18:53 <GitHub74> [artiq] sbourdeauducq pushed 2 new commits to master: http://git.io/vm2Js

18:53 <GitHub74> artiq/master a83473a Sebastien Bourdeauducq: sync_struct: clarify notify_cb doc

18:53 <GitHub74> artiq/master 9649e18 Sebastien Bourdeauducq: gui: basic plotting

19:18 sb0 has quit [Quit: Leaving]

19:21 cr1901 has quit [Ping timeout: 250 seconds]

19:23 cr1901 has joined #m-labs

19:23 sb0 has joined #m-labs

19:39 ylamarre has quit [Quit: ylamarre]

20:39 sb0 has quit [Read error: Connection reset by peer]

20:55 sb0 has joined #m-labs

22:53 mumptai has quit [Quit: Verlassend]

23:15 <cr1901> stupid question I'm thinking about... suppose I'm running an application that depends on lib A" >>

23:15 <cr1901> "A"*

23:16 <cr1901> I decide to upgrade lib A while this application is still running. Some pages of lib A will be in mem as the upgrade overwrites the file.

23:16 <cr1901> on disk*

23:17 <cr1901> Why doesn't the application crash on the next page fault where the library's page is not kept in the page cache?

23:17 <cr1901> (assuming the binary code between old and new lib A is different at that offset)

23:18 <cr1901> Wait nevermind, somethings coming back to me... an overwritten file isn't actually deleted from disk until it's not in use anymore I think

23:26 cr1901 has quit [Ping timeout: 246 seconds]

23:28 cr1901 has joined #m-labs

23:35 cr1901 has quit [Ping timeout: 240 seconds]

23:37 cr1901 has joined #m-labs

23:50 cr1901 has quit [Ping timeout: 246 seconds]

23:52 cr1901 has joined #m-labs