#qi-hardware on 2011-08-12 — irc logs at freenode.irclog.whitequark.org

00:01 <qi-bot> [commit] Bart van Strien: Add libunistring (master) http://qi-hw.com/p/openwrt-packages/bb2a84c

00:01 <bartbes> that took way too long

00:03 <wolfspraul> all things worth doing took longer than they should have

00:53 <wolfspraul> xiangfu: reading backlog about flash

00:53 <wolfspraul> so by power cycling, you managed to flip a bit in your nor flash?

00:54 <xiangfu> maybe I am not sure.

00:54 <wolfspraul> unfortunately the rc2 and rc3 boards are different in that area, that complicates our search

00:54 <xiangfu> werner replied in mailing list. If this is really a NOR corruption and if it is caused by incorrect

00:54 <xiangfu> sequencing of power supplies, perhaps power-cycling (instead of the

00:54 <xiangfu> reset button) could make it happen more often.

00:55 <xiangfu> I don't know how to narrow down the bug. needs help from Werner and Adam :)

00:55 <wolfspraul> what is the reset button? werner means reset in the gui?

00:55 <xiangfu> I think he means press all three buttons for reboot.

00:57 <xiangfu> I am do "replug the dc adapter male plug" for power-cycling

02:32 <wpwrak> xiangfu: (reset button) err yes, whatever button press makes the M1 reset ;-)

02:34 <wolfspraul> ok so we have 3 ways to reset:

02:34 <wpwrak> for now, i'd suggest to establish how often this happens. i.e., with a fixed reset procedure, repeat until hitting maybe 10 or more corruption events. (after each, reflash)

02:34 <wolfspraul> 1) cold removal of power supply, either by removing DC jack or by unplugging/switching off the power behind the adapter

02:34 <wolfspraul> 2) press three buttons at once for reset

02:34 <wolfspraul> 3) power off (or reset) in the gui

02:34 <wolfspraul> so actually it's 5 ways

02:34 <wolfspraul> 1) unplug DC jack

02:35 <wolfspraul> 2) unplug power supply itself from mains

02:35 <wolfspraul> 3) press three buttons

02:35 <wolfspraul> 4) 'reset' in gui

02:35 <wolfspraul> 5) 'power off' in gui

02:35 <qi-bot> The build has FAILED, see log here: http://fidelio.qi-hardware.com/~xiangfu/compile-log/openwrt-xburst.full_system-08112011-0046/

02:37 <kristianpaul> bartbes: seems you managed to solve it :) sorry i wasnt able to help. actually now that there is a openwrt-milkymist port i think i'll care more about later about learn about makefiles and packaging

02:43 <wolfspraul> wpwrak: 10 corruption events!!! :-)

02:43 <wolfspraul> you will drive everybody to heart attack. This event is relatively rare, don't forget.

02:43 <wolfspraul> if adam sets 38 boards to 'available', that's 380 full power to render cycles without any such occurance.

02:43 <wolfspraul> and maybe 3-5 boards did show the problem in between

02:43 <wolfspraul> so maybe 1-2 times per 100 power cycles

02:43 <wolfspraul> if you look at it that way

02:43 <wolfspraul> but right now we know too little

02:43 <wolfspraul> not just 'power cycle', actually it's a full rendering cycle with boot-to-render and then let it render for 30 seconds

02:43 <wpwrak> i think we could try and see if unplugging the DC jack does the trick. unplugging mains would be even nastier, but it's also less controllable

02:43 <wpwrak> xiangfu: btw, after flashing, are the partitions write-protected again ?

02:44 <wpwrak> (10 events) yeah, but otherwise you don't know how long you have to test to be reasonably sure you've nailed it :)

02:45 <wpwrak> 100 events would of course be better ;-)

02:47 <wpwrak> the good news: unless adam always runs the tests and checks the test logs, he may see a much lower than real incidence, because he'll probably only notice bitstream corruptions

02:47 <xiangfu> wpwrak, (write protected) hmm... I use urjtag flash them. there is now write-protect like command in urjtag

02:48 <xiangfu> wpwrak, the flash output is like unlock... flashing... unlock... flashing...

02:48 <wpwrak> ;-)

02:49 <xiangfu> wpwrak, how this write-protect works?

02:49 <wpwrak> lemme check ... there are many different ways how NOR chips do this

02:49 <aw_> 0x7F histories: 1. After reflash, D2/D3 is dimly lit by used 1.8M usb cable. 2. http://downloads.qi-hardware.com/hardware/milkymist_one/production/rc3/test_results/7F-reflash-results 3. can reconfigure after powered on since replaced u7/u19/u20 and couple days late 4. tested image pass

02:50 <aw_> please don't ask me do some tests now....Â Â i just posted here if i found interesting. ;-)

02:52 <wolfspraul> aw_: what's interesting about 0x7F ?

02:52 <wolfspraul> 0x7F passes all tests now?

02:52 <wpwrak> http://www.micron.com/get-document/?documentId=6062&file=319942_J3_65_256M_MLC_DS.pdf

02:53 <wpwrak> page 28. "Configurable Block Locking" ... " For additional information and collateral request, please contact your filed representative ." haha, very funny

02:53 <aw_> the one was d2/d3 dimly lit after used 1.8 M ago. then today I powered on and d2 is fully off so that I started to test it. ;-) surely 0x7f was replaced new u7/u19/u20. ;-)

02:55 <aw_> now it's all passed except 8:10 which i forgot to insert card. ;-) do rendering next. :)

02:55 <wolfspraul> ok

02:55 <wolfspraul> not sure whether that's related to the flash problems

02:55 <wpwrak> xiangfu: however, section 10.1 (on the same page) might be useful. if the blocks are left unlocked, that's an invitation for trouble. you probably still have partitions where writes are expected and don't want to lock these. but at least the bitstream and maybe also the rescue stuff should probably default to being locked

02:55 <wolfspraul> but that's why you go through the entire batch now, so that we can then see the flash problem more clearly

02:56 <kristianpaul> (locking) good, now i can finally call that nor chip rom :)

02:57 <wolfspraul> xiangfu: when your board couldn't boot anymore because standby was corrupted (maybe), was the rescue boot still working?

02:57 <wpwrak> kristianpaul: you could, if you didn't need to store data there. when you download patches, where do they go ? i'd guess to the NOR, right ?

02:58 <xiangfu> wpwrak, thanks. so there are some addition area for store the block lock info?

02:58 <kristianpaul> wpwrak: actually sebastien idea is that, for a writable FSÂ Â memory card have a job :)

02:58 <aw_> hmm...0x7f: bad that can't configure after powered-cyle. :(

02:58 <kristianpaul> wpwrak: pathces can be loaded from memcard

02:58 <xiangfu> wpwrak, (rescue stuff shoudl be locked) yes.

02:59 <kristianpaul> wpwrak: store, i think user and pass for ftp, dunno what else i lost the track to flickerinose

02:59 <wolfspraul> aw_: wait

02:59 <wolfspraul> so you just reflashed 0x7f, successfully (according to flash script)

02:59 <wolfspraul> then you ran the test software that was loaded over serial, and it succeeded

03:00 <aw_> wolfspraul, no reflash

03:00 <wpwrak> section 10.4 is funny. a "password access", some 64 bits string. as if that accomplished much ;-))

03:00 <aw_> yes

03:00 <wolfspraul> now you try to power cycle but it won't boot?

03:00 <wolfspraul> how did you power cycle?

03:00 <wolfspraul> and how does it stop? d2/d3 dimly lit?

03:00 <aw_> it successed. so I inserted 8:10 card then powered on ...and d2 is dimly lit though. :(

03:00 <kristianpaul> wpwrak: (password) so murphy cant guess that one :)

03:01 <aw_> i unplugged the dc jack's plug

03:01 <aw_> and replugged it

03:01 <wolfspraul> is 'd2 dimly lit' the same as the nor flash corruption or a separate bug?

03:01 <wpwrak> kristianpaul: naw, i think it's meant as a means to protect confidential content. now, how hard would it to record that bit string ? :)

03:02 <kristianpaul> ahh

03:02 <kristianpaul> shame

03:02 <kristianpaul> hum yes

03:02 <aw_> seems once it had have d2/d3 dimly litÂ Â after first flash, it will be easily show d2 dimly lit once power on

03:03 <wpwrak> xiangfu: (lock) in the NOR data sheet, sections 6.1 and 6.2. it should be very similar to the unlocking operation

03:03 <aw_> well..records firstly then continue other boards.

03:04 <wolfspraul> yeah

03:04 <wolfspraul> wasn't the reset circuit meant to fix the 'd2 dimly lit' problem?

03:04 <wpwrak> wasn't it reset -> fix NOR corruption and NOR corruption -> "d2 dimly lit" ?

03:05 <wolfspraul> unfortunately that's what we will only really learn and understand right now in the middle of the rc3 run ;-)

03:05 <wpwrak> so i think we have some evidence that the reset circuit in its present state isn't sufficient to make NOR corruption go away

03:06 <wolfspraul> hmm

03:06 <wolfspraul> the 'd2 dimly lit' problem I knew about went after after power cycling again

03:06 <wpwrak> (evidence) at least xiangfu's M1rc2 seems to suffer real NOR corruption. we haven't properly established that on an M1rc3 yet, though

03:07 <wolfspraul> aw_: can you try to power cycle 0x7F three times?

03:07 <wpwrak> ah, interesting. thought that was also the symptom of bad NOR

03:07 <aw_> wolfspraul, ok

03:07 <wolfspraul> many m1 rc2 users have found workarounds for power cycle/boot problems for themselves

03:08 <wolfspraul> that complicates our analysis now

03:08 <xiangfu> wpwrak, I only find the unlock code in urjtag. not found the lock code.

03:09 <xiangfu> wpwrak, let me find the source code url.

03:09 <aw_> wolfspraul, all d2 dimly lit now in three times powered-cycle.

03:09 <wolfspraul> the data they report is biased because of their workarounds. so we need to try to remove that now.

03:09 <wolfspraul> aw_: he :-) try another 3 times, wait 5 seconds in between.

03:09 <wpwrak> xiangfu: check the data sheet. it describes the lock/unlock process. once you've found the same bytes for the unlock, it should be easy to do the locking

03:09 <wpwrak> xiangfu: http://www.micron.com/get-document/?documentId=6062&file=319942_J3_65_256M_MLC_DS.pdf

03:10 <aw_> wolfspraul, already stays 5 seconds in between.

03:10 <wpwrak> xiangfu: pages 18 to 20

03:10 <wolfspraul> can you lock each page separately?

03:10 <wolfspraul> and the locking is stored in the nor as well?

03:10 <wpwrak> wolfspraul: correct (2x)

03:10 <wolfspraul> aw_: yes just try another 3 times, to be sure

03:11 <wolfspraul> if it is so persistent, it's definitely not the 'd2 dimly lit' problem I know from my rc2

03:11 <wpwrak> wolfspraul: well, each block. NOT doesn't have pages :)

03:11 <wpwrak> s/NOT/NOR/

03:11 <wolfspraul> and our partitions end in full blocks?

03:11 <wpwrak> xiangfu ?

03:12 <xiangfu> wpwrak, yes. end in full blocks

03:13 <xiangfu> wpwrak, here is the code for reflash m1: http://urjtag.git.sourceforge.net/git/gitweb.cgi?p=urjtag/urjtag;a=blob;f=urjtag/src/flash/intel.c;h=6e13a4363f73a29a4130c9358b0e2754fcfd2678;hb=HEAD

03:14 <aw_> wolfspraul, still d2/d3 dimly lit, wait at least 5 seconds in between. i felt it keeps this stage unless I put aside it long long time even day. ;-)

03:14 <wolfspraul> hmm

03:14 <wolfspraul> ok

03:14 <wolfspraul> move it aside

03:14 <wolfspraul> :-)

03:14 <wolfspraul> a proper (!) flash corruption will not fix itself

03:14 <wolfspraul> :-)

03:15 <wolfspraul> so it should not come back even after several days, which it sometimes does

03:15 <wolfspraul> so I think there are several different problems here, masking each other partially

03:15 <wolfspraul> aw_: just continue with the full round, then we look at all data carefully

03:16 <aw_> wolfspraul, yes

03:16 <wolfspraul> ok so right now, we are not locking anything in the nor flash

03:16 <wpwrak> xiangfu: you can probably just copy the unlock functions, change UNLOCK_BLOCK to LOCK_BLOCK, and you're done

03:16 <wolfspraul> but werner proposes to look several parts like rescue bitstream and maybe more

03:16 <kristianpaul> (fix itself) so bus corruption? fpga..Â Â a 20Ghz scope near? ;)

03:16 <wpwrak> xiangfu: well, plus calling them ;-)

03:17 <wolfspraul> s/look/lock/

03:17 <wolfspraul> if we lock anything, my thoughts would be: a) how does that impact the ability for web updates or other updates

03:18 <wpwrak> is the NOR mapped in the LM32's memory address space ?

03:18 <wolfspraul> b) are we just covering up the real bug behind a lock (even if effective), or is this a proper fix still?

03:18 <wolfspraul> just my thoughts, nothing else

03:18 <wpwrak> just an extra protection

03:18 <wpwrak> so you shouldn't set the locks when hunting the corruption

03:19 <wolfspraul> and the locks need to be removed for updates

03:19 <kristianpaul> wpwrak: yes is mapped

03:19 <wpwrak> then not locking those things borders on insanity ;-)

03:20 <wolfspraul> I don't think the "power-to-render cycles leading to unreconfigurable board" is related to anything the fpga does during rendering

03:20 <wpwrak> considering that there's not even an MMU. any little sw bug can corrupt your NOR :)

03:20 <wolfspraul> that's because we see this problem regularly when doing sets of 10 power cycles with 30 second rendering sprints

03:20 <kristianpaul> ;)

03:21 <wolfspraul> but I never once have heard from it after a multi-hour rendering

03:21 <wolfspraul> that's a very weak logic, but still

03:21 <wolfspraul> it could be that long renderings are rare, and we are not focusing enough on this problem

03:21 <wolfspraul> that's not to say that an unlocked memory mapped NOR is insane

03:22 <wpwrak> wolfspraul: (ability to update) the update process would have to unlock before writing, then lock again. should be no problem.

03:22 <wolfspraul> welcome to m1 :-)

03:22 <wolfspraul> yes

03:22 <wolfspraul> but that needs to be added

03:22 <wolfspraul> there are two risks in start selling rc3 now, basically signing off boards to leave Taipei

03:22 <kristianpaul> wpwrak: okay that another reason for a MMU, i think now i get more sense to me have one :)

03:22 <wolfspraul> the first risk is that the hardware is physically in a state that requires a fix later (a hardware fix)

03:23 <wolfspraul> the second risk is that it is a software problem only, but the board is driven into a state where a normal user cannot recover it anymore, leading to them potentially having to ship units around the world for unbricking

03:24 <wpwrak> (corrupting NOR via writes) i think it's a little more difficult than just doing a single bus cycle, but with protection off and all that, you're a lot closer to being able to inflict mayhem than you want to be

03:25 <wpwrak> wolfspraul: yes, you could try and see if locking properly protects the rescue partitions. that would at least allow recovery without usb-jtag. but without solving the origin of the corruption (which may be hw), M1s would still see corruption

03:25 <wpwrak> just in recoverable partitions. e.g., the one where all the patches for your show tonight are :)

03:25 <wolfspraul> that's exactly how I see it too

03:26 <wolfspraul> a lot of work ;-)

03:26 <wolfspraul> oh the units are not 'production ready' in its current state

03:27 <wpwrak> the evidence pointing to power cycling being a factor is strong. particularly given that people who rarely power cycle but reset often don't seem to experience NOR corruption easily

03:27 <wolfspraul> since the normal (web) update does not update the rescue stuff, it would be a relatively easy next step to lock all rescue partitions

03:27 <wolfspraul> xiangfu: so maybe you can try to get the locking done, and the we regularly lock all rescue partitions, as the normal process of reflash_m1.sh ?

03:28 <wolfspraul> I don't see the downside to that right now

03:28 <xiangfu> wolfspraul, ye. lock all rescue part partitions should be ok.

03:29 <wolfspraul> how many partitions is that?

03:29 <wolfspraul> I still don't have a mental map of all our partitions

03:29 <wpwrak> to corrupt the NOR, this should do nicely: volatile uint32_t *p = (void *) 0xSOMEWHERE; *p = 0x40; *p = 0;

03:29 <xiangfu> wpwrak, how can I test if the lock is correct. write some thing to this area then readback. will different. right?

03:29 <wolfspraul> test?

03:29 <wolfspraul> just lock then be happy

03:29 <wolfspraul> :-)

03:30 <xiangfu> i mean make sure the code is do lock correct.

03:30 <wolfspraul> sure, I was joking

03:30 <wpwrak> you could use the code snippet from above. if the lock works, then it won't be able to zero the word in question. else, ... :)

03:30 <xiangfu> wolfspraul, http://www.milkymist.org/wiki/index.php?title=Flashing_the_Milkymist_One#Flash_Memory_Distribution

03:30 <xiangfu> wolfspraul, all rescue + standby. so 5 partitions

03:32 <wolfspraul> xiangfu: the standby bitstream is needed by the rescue boot path?

03:32 <xiangfu> it will goto standby after you plug the power.

03:33 <wolfspraul> so it's always needed, even in rescue mode?

03:33 <xiangfu> yes.

03:33 <wolfspraul> is the standby bitstream updated by the web update?

03:33 <xiangfu> no

03:34 <wolfspraul> then it should probably be locked as well

03:34 <wpwrak> xiangfu: it seems that you read the lock bit with the Read Device Information command. that command changes the way the NOR behaves. reads then return status information, not the NOR data.

03:34 <xiangfu> if I understand correct. when plug power. fpga will load standby immediately, for enable the power button. reboot button. etc.

03:34 <wpwrak> xiangfu: then you can retrieve the lock bit, see page 22, table 9

03:34 <wolfspraul> ah yes, you said that already

03:34 <wolfspraul> in total lock 5 partitions

03:35 <wolfspraul> btw, the single-bit corruption (if it was one) xiangfu saw is not likely caused by a simple software pointer problem. that would have been much more likely to overwrite an entire word or more

03:37 <wpwrak> probably lock more than 5. the regular bitstream for sure. then, does FN normally need to write to BIOS, splash, APP ? or only to Data ?

03:38 <wpwrak> i don't know how often you can lock/unlock. probably not more often than you can write a regular NOR cell. so locking/unlocking should roughly follow the frequency of program cycles of the respective block.

03:39 <wpwrak> you may want to ask numonyx for clarification, though

03:39 <xiangfu> wpwrak, (read device Information command) yes.

03:39 <wpwrak> wolfspraul: 0 is a very common word value :)

03:40 <wolfspraul> but only 1 bit was changed

03:40 <wpwrak> wolfspraul: remember that the transition was 0x1000 -> 0x0000

03:40 <wpwrak> wolfspraul: yes, there was only one "1" bit there to destroy :)

03:40 <xiangfu> more diff: http://dpaste.com/592480/

03:40 <wolfspraul> if the standby bitstream is corrupted in random ways (offsets), then whether D2/D3 stay fully off, or dimly lit, may be just a coincidence and caused by the same root problem

03:41 <wolfspraul> hmm, true

03:42 <wpwrak> xiangfu: hmm, does that mean that everything after 0078dd0 is 0 ? or that the file ended at 0078dd0 ?

03:42 <xiangfu> also we maybe needs erase all NOR flash before flash .

03:43 <xiangfu> wpwrak, the origin file is only 495060 length. so it end at 0078dd0

03:43 <xiangfu> wpwrak, when I read back the standy , I read whole 640KB from m1. so it end at 00a0000

03:43 <wpwrak> xiangfu: ah, so the stuff at the end is a retrieval artefact

03:44 <wolfspraul> xiangfu: yes erase all sounds good. we don't do that now?

03:45 <xiangfu> no

03:45 <wolfspraul> how fast/slow would it be?

03:45 <xiangfu> erase very fast.

03:45 <xiangfu> acceptable

03:45 <wpwrak> you probably erase each block before writing it. that may or may not be sufficient. depends a bit on what the software expects.

03:46 <wolfspraul> in a perfect world we should not need the erase, I guess

03:46 <wpwrak> i think you do

03:46 <wpwrak> erase: 0/1 -> 1. write: 1 -> 0

03:46 <xiangfu> when flash we do need erase.

03:46 <wpwrak> well, write: 0/1 -> 0

03:47 <xiangfu> see the http://dpaste.com/592480/ line 16.

03:47 <xiangfu> it maybe because the last standby.bin is small then the previous one

03:48 <xiangfu> erase all nor flash can make all those bit to '1' :)

03:49 <wpwrak> xiangfu: those two extra words (0004 0004) are indeed a little odd. they're within the same block. so they must have been erased. (if you never erased, you would have noticed by now :)

03:49 <wpwrak> xiangfu: so something is writing a bit of extra data that's not in the file

03:50 <xiangfu> wpwrak, (forget the block size). yes. something is writing a bit of extra data.

03:50 <wpwrak> one more for the bug pile ;-)

03:50 <xiangfu> then that is a bug in urjtag

03:50 <xiangfu> yes.

03:51 <wpwrak> yeah, probably urjtag

03:51 <xiangfu> I can read more partition and compare .

03:51 <xiangfu> see if this happen in other partitions.

03:51 <wpwrak> maybe someÂ Â for (i = 0; i <= n; i++) program_word(i);Â Â :)

03:54 <kristianpaul> fake a file witha now patter

03:54 <kristianpaul> write it read back

03:54 <kristianpaul> comapre :)

03:54 <wpwrak> then change the size and repeat. dd if=/dev/urandomÂ Â is your friend :)

03:59 <wolfspraul> bugs everywhere. sigh. but I need to decide whether we can start selling 'good' rc3 boards or not ;-0

04:01 <wolfspraul> at least we have _lots_ of good starting points

04:06 <wolfspraul> do we have consensus that we should add a full 32 megabytes erase to reflash_m1.sh ?

04:07 <kristianpaul> at least will help to track corupt yes ;)

04:09 <wolfspraul> xiangfu: is Adam using reflash_m1.sh or reflash_all.batch ?

04:11 <wolfspraul> is this up-to-date? http://en.qi-hardware.com/wiki/Milkymist_One_run_3_schedule#Flash_Test_Tool_Image

04:13 <aw_> wolfspraul xiangfu , i used reflash_m1.sh not reflash_all.batch

04:13 <kristianpaul> even is memory mapped i cant get to write as easy as it could sound..

04:13 <kristianpaul> but yes i can read

04:13 <kristianpaul> (NOR)

04:14 <wolfspraul> ok we should update the wiki instructions, also the wiki points to a commit etc - quite confusing

04:14 <wolfspraul> I mean for the test software image

04:15 <wolfspraul> aw_: don't worry, you focus on the boards Xiangfu is working on the tools :-)

04:16 <wolfspraul> 42 'available' now, not bad

04:17 <aw_> wolfspraul, good. i just used his xiangfu's last email in private last time though. not sure if already existed on public server.

04:17 <kristianpaul> sell sell ! _)

04:17 <wolfspraul> growing :-)

04:17 <aw_> xiangfu, http://pastebin.com/cAcWHGBN

04:17 <wolfspraul> well, xiangfu needs to make sure it's all public and update the wiki too. then it's easier for others to follow the exact same process.

04:18 <aw_> alright then ..i go lunch firstly then back to rework on usb.

04:22 <kristianpaul> (update) yeah, lots of usefull scripts now

04:26 <wolfspraul> xiangfu: alright, so Adam uses reflash_m1.sh

04:27 <wolfspraul> should we add a full 32 megabytes erase to reflash_m1.sh ?

04:28 <kristianpaul> lol my LG remote now generates ramdon data on flickernoise :)

04:29 <wolfspraul> yes I've seen that as well

04:29 <xiangfu> at least we can make sure all non-used area is '1'

04:29 <xiangfu> kristianpaul, it is a test screen.

04:29 <xiangfu> kristianpaul, wait input from serial console. 'b' is for 'boot'

04:30 <wolfspraul> I'm just asking what the consensus now is?

04:30 <kristianpaul> xiangfu: boot for?

04:30 <wolfspraul> we already said we want to lock 5 partitions (standby+rescue)

04:30 <wolfspraul> that's clear

04:30 <xiangfu> boot to flickernoise. normal boot

04:30 <kristianpaul> ah, sureÂ Â running it now :)

04:30 <kristianpaul> i could not resist

04:30 <kristianpaul> it work out of the box !

04:31 <xiangfu> kristianpaul, there is a help message output from serial console. that can test screen.

04:31 <kristianpaul> bit slow bott.. but well :)

04:31 <wolfspraul> how about adding nor erase, don't know

04:31 <wolfspraul> maybe a good idea to establish a baseline

04:31 <wolfspraul> also because we have so many uncertainties in the tools, jtag, cable, signal integrity, etc. etc.

04:31 <wolfspraul> seems like uncertainties everywhere

04:32 <wolfspraul> so if we add some more baselines, even if they are theoretically unnecessary, it could help pull some of the other uncertainties apart

04:32 <kristianpaul> okay i'll sleep now but let mm1 rendering whole nigh !

04:32 <wolfspraul> that's why I was thinking maybe reflash_m1.sh, at least when it reflashs all partitions, should start with a full 32 megabytes erase

04:33 <kristianpaul> will be nice as temp will drop to 19Â°C (28Â°C now)

04:33 <kristianpaul> s/will/may

04:33 <kristianpaul> gn8

04:33 <wolfspraul> n8

04:37 <kristianpaul> btw a knowlesdge base for know workarounds is not bad ;)

04:38 <kristianpaul> to have, i think

04:38 <wolfspraul> well

04:38 <wolfspraul> not so easy

04:38 <wolfspraul> too few users, too many bugs

04:39 <wolfspraul> the beginning is difficult, and that's what we go through now

04:39 <wolfspraul> with good will and a little patience, we'll make it

04:40 <kristianpaul> (users) true, no worth effort yet

04:40 <xiangfu> wiki page : http://en.qi-hardware.com/wiki/Milkymist_One_run_3_schedule#Flash_Flickernoise_1.0RC1_.2F_SoC_1.0_updates_and_Run_3_images

04:40 <xiangfu> update a little.

04:41 <xiangfu> should be more clear then before. delete out-data script.Â Â we only use one in RC3.

04:41 <xiangfu> http://milkymist.org/updates/2011-07-13/for-rc3/reflash_m1.sh

05:13 <wpwrak> wolfspraul: (full erase) doesn't seem necessary

05:27 <ignatius-> The JLime kernel tree compiles and sees the entire NAND. I wasn't able to get previous kernels to see that extra NAND. I've deducted that it my be a kernel option. Anyone know what that might be?

08:26 <xiangfu> aw_, Hi

08:26 <xiangfu> http://milkymist.org/updates/current/for-rc3/reflash_m1.sh. I update the reflash_m1.sh, erase the whole nor flash before write anything.

08:26 <aw_> xiangfu, hi yes

08:27 <xiangfu> aw_, you can use new one(__VERSION__="2011-08-12") from now on.

08:27 <aw_> xiangfu, okay.great

08:29 <aw_> xiangfu, just directly use this 08-12 script only, right? no else image file attached?

08:30 <xiangfu> aw_, no needs touch any other files. just this 08-12 script.

08:32 <aw_> xiangfu, okay..I'll mark note on those rest boards's record, so i know which board is done by erase. thanks.

08:32 <xiangfu> aw_, your reflash log is enough. yes. you can mark note on boards

08:34 <xiangfu> aw_, theÂ Â reflash_m1.sh output will be different

08:36 <aw_> xiangfu, okay..I'll watch it..now still reworking. :)

08:39 <xiangfu> aw_, sorry. I upload a new one. if you already download, download again. just output the VERSION before it start flash.

08:40 <aw_> xiangfu, alright. ;-)

10:41 <`antonio`> bartbes, hi, did you manage to compileÂ Â guile 2.0

10:41 <bartbes> `antonio`: still working on it, I just found another dependency

10:42 <bartbes> which was delayed because that project's hosting went down for a bit

10:42 <bartbes> it's back up again, so that's building now

10:44 <`antonio`> how is it going then ?

10:44 <`antonio`> i a trying to build guile 2.0 as well

10:44 <`antonio`> but i'm having some problems

10:46 <bartbes> more dependencies, woo

10:47 <jivs> bartbes, did u compile libunistring successfully?

10:47 <bartbes> yes

10:47 <jivs> any patch required?

10:48 <bartbes> yes

10:48 <bartbes> it's in the repo already

10:48 <jivs> is it uclibc related?

10:48 <bartbes> http://projects.qi-hardware.com/index.php/p/openwrt-packages/source/tree/master/libunistring

10:48 <bartbes> yeah

10:49 <jivs> ok cool, we had the same issue, patched it. but later we had some other errors related to locale

10:50 <bartbes> I can only hope I didn't break it

10:50 <bartbes> time will tell

10:50 <`antonio`> guile 2.0 have some problems in the future with locale

10:51 <`antonio`> bartbes, did you create any patch for guile 2.0 ?

10:51 <bartbes> jivs: if it is, indeed broken, I will see if I can work around, i.e. add some UCLIBC code

10:51 <bartbes> I had to exchange a faulty configure.ac line, so far

10:51 <jivs> we patched it this way, may be it was not right.: #Â Â if __GLIBC__ >= 2

10:51 <jivs> +#Â Â if __GLIBC__ >= 2Â Â && !defined __UCLIBC__

10:52 <jivs> ok

10:52 <bartbes> do note that I have yet to get it to actually get to compiling

10:53 <bartbes> finally, configure finished

10:53 <bartbes> now, we'll see what happens

10:53 <bartbes> .. yeah

10:53 <jivs> cool

10:53 <`antonio`> bartbes, yeh we passed configure

10:53 <bartbes> that failed

10:53 <`antonio`> pastebin it

10:55 <bartbes> duplocal makes pointer from integer

10:55 <`antonio`> cool

10:55 <`antonio`> that's fine i solved that

10:55 <bartbes> do tell

10:56 <`antonio`> I hope that's the proper way of doing it:

10:56 <`antonio`> basically in the guile make file add the CONFIGURE_ARGS

10:57 <`antonio`> CONFIGURE_ARGS += -C

10:57 <`antonio`> and do make package/guile/{clean,compile} V=99

10:57 <bartbes> "Â Â -C, --config-cacheÂ Â Â Â Â Â alias for `--cache-file=config.cache'"?

10:58 <`antonio`> yeh

10:58 <bartbes> hmm right

10:58 <`antonio`> in the config.cache

10:58 <`antonio`> basically is failing because we need to set some proper variables

10:58 <`antonio`> i'll give you the line in a sec

10:59 <bartbes> it's already recompiling

10:59 <`antonio`> do clean,prepare then

10:59 <`antonio`> don't let it pass the configuring otherview it will not go through the config.cache

11:00 <`antonio`> in the config.cache search for duplocaleÂ Â and there is something like this

11:00 <`antonio`> gl_cv_func_duplocale_works=${gl_cv_func_duplocale_works='guessing no'}

11:00 <`antonio`> change no to yes

11:00 <`antonio`> and it will pass that error

11:00 <bartbes> then.. this sounds like a very hacky solution

11:00 <bartbes> the thing is, I know it doesn't work

11:00 <`antonio`> this is a temporary solution

11:01 <bartbes> because I believe it is the function I nerfed

11:05 <jivs> bartbes, do u think this error might be related with libunistring in some way?

11:05 <bartbes> I would expect that, yes

11:07 <bartbes> hmm

11:08 <bartbes> it is a different function

11:18 <`antonio`> which one?

11:18 <jow_laptop> `antonio`: the proper way to override that (within an OpenWrt makefile) isÂ Â CONFIGURE_VARS += gl_cv_func_duplocale_works=yes

11:18 <jow_laptop> no need to edit a config.cache

11:19 <`antonio`> jow_laptop: nice, thanks

11:19 <jow_laptop> the configure should then output something likeÂ Â "Checking for foo ... yes (cached)"

11:19 <jivs> jow_laptop, thanks

11:22 <bartbes> is there a MAKE_ARGS thing too?

11:22 <bartbes> jow_laptop: ah, cool

11:23 <jow_laptop> bartbes: yes

11:24 <bartbes> I was going to override it at a later point, during make, but this probably works

11:24 <jow_laptop> there isÂ Â MAKE_VARSÂ Â which overrides the environment (e.g.Â Â FOO=bar make ...)

11:25 <jow_laptop> andÂ Â MAKE_FLAGSÂ Â which extends the args (e.g.Â Â make FOO=bar)

11:25 <bartbes> right, so if it doesn't work I can play with MAKE_FLAGS, thanks

11:26 <jow_laptop> just be sure to always append (+=) to those vars since they already contain a bunch of default overrides and variables

14:56 <bartbes> `antonio`: I managed to get it down to a link error

14:56 <bartbes> updating the old patch should fix that

14:57 <`antonio`> can you paste bin it

14:57 <jivs> cool bartbes

14:57 <bartbes> also time to take out all my desperate attempts

14:58 <bartbes> it's just the csqrt one

14:59 <bartbes> also, do you know what this 'issue with threads' is?

14:59 <jivs> csqrt, is it similar to the guile1.8.7 patch

14:59 <bartbes> yeah

14:59 <bartbes> so easy

15:00 <jivs> there is a patch for that already on 1.8.7, hopefully it will work

15:01 <bartbes> no need to

15:01 <bartbes> I know what it does

15:01 <bartbes> so I can just replicate it

15:01 <jivs> okay

15:01 <bartbes> I guess I might as well start working on become the richest man in the world

15:01 <bartbes> because that will finish sooner than this compile

15:02 <jivs> so can be solved using configure_vars from Makefile. isn't it?

15:02 <`antonio`> bartbes, how long it takes in your machine ?

15:02 <bartbes> jivs: that's what I tried

15:03 <jivs> cool

15:04 <bartbes> `antonio`: not as long as I made it out to be, but 10 mins, I guess

15:04 <bartbes> if this build fails I'll time it for you

15:04 <bartbes> (hoping it doesn't, though)

15:05 <jivs> lets be optimistic :-)

15:06 <bartbes> well, you know, I'm undoing my desperate measures

15:06 <bartbes> so it might very well happen

15:21 <bartbes> `antonio`: well, 10 minutes seems like a good estimate for the source to compile

15:21 <bartbes> it's been chewing through docs for a while now, though

15:21 <bartbes> stupid texinfo manuals..

15:22 <`antonio`> so successfully completed ?

15:25 <`antonio`> bartbes, then you'r almost there

15:32 <bartbes> it's still creating manuals

15:32 <bartbes> I'm going to have to see if there's an option to turn that off

15:58 <bartbes> `antonio`: progress update: still compiling texinfo manuals

15:58 <bartbes> not a fun activity

15:58 <`antonio`> what processor do yo have?

15:59 <bartbes> it's compiling on an old p4

15:59 <bartbes> but still, it's coming up to 45 mins

15:59 <jivs> still no new error, so good going

16:00 <jivs> if it completes fine, will be worth the wait ...

16:00 <bartbes> jivs: like I said, it's been compiling docs (with the same command) for 45 mins!

16:00 <bartbes> yeah, I'll just let xiangfu dispatch the build server or something

16:00 <bartbes> like hell I'm compiling these docs again..

16:04 <`antonio`> wpwrak, I am following the INSTALL-Ben instructions and applying patches to the kernel but whenÂ Â I install the kernel in my nanonote I getÂ Â "ERROR: Can't get kernel image!". the problem might be that I am using my own image, can I apply those patches directly to the toolchain?Â Â

16:15 <`antonio`> bartbes, got to go now, let me know if you got it working !

17:48 <jivs> bartbes, How did it go?

17:49 <bartbes> still.. making.. docs..

17:49 <jivs> omg

17:50 <jivs> have u found any way to disable that for future!

17:51 <bartbes> not yet

17:51 <bartbes> :(

17:53 <jivs> I will also start in my toolchain soon. but its not that powerful though...

17:54 <jivs> bbiab

17:56 <bartbes> guild snarf-check-and-output-texiÂ Â Â Â Â Â Â Â Â Â > guile-procedures.texi || { rm guile-procedures.texi; false; }

17:56 <bartbes> 30425 pts/1Â Â Â Â R+Â Â 152:41 /bin/sh /media/shared/home/nanonote/openwrt-xburst/build_dir/target-mipsel_uClibc-0.9.32/guile-2.0.2/meta/guile -e (@@ (guild) main) -s /media/shared/home/nanonote/openwrt-xburst/build_dir/target-mipsel_uClibc-0.9.32/guile-2.0.2/meta/guild snarf-check-and-output-texi

17:56 <bartbes> oh, minus that first line

18:02 <bartbes> I couldn't help but interrupt

18:02 <bartbes> this wasn't going to work

18:07 <jivs> oh

18:08 <jivs> i think there is some patch to disable snarf on guile 1.87, will that help us now?

18:18 <viric> I've always wondered how someone building linux manage the memory it is going to use

18:18 <viric> (user programs apart, of course)

18:18 <viric> Looking for that, I never found the information I wanted. Does anybody here happen to knuw much about that?

18:18 <viric> know

18:20 <viric> it's clear how to compile away code

18:20 <bartbes> jivs: probably

18:20 <viric> but not-code... how?

18:20 <bartbes> jivs: I hope so, because my attempt failed

18:21 <jivs> i will update you my progress..

18:40 <bartbes> jivs: please tell me you've found a way to disable the docs yet

18:40 <jivs> bartbes, can u paste the second confiigure_args

18:41 <jivs> configure_vars

18:41 <bartbes> CONFIGURE_VARS += gl_cv_func_duplocale_works=yes guile_cv_use_csqrt="no, Ben NanoNote (cross-compiling)"

18:42 <jivs> sorry I don't have that good news yet..

18:42 <bartbes> the worst part is that it takes about 10 mins to verify..

18:42 <jivs> did u get this error? :->Â Â i18n.c: In function 'str_upcase_l':

18:42 <jivs> i18n.c:874:12: error: dereferencing pointer to incomplete type

18:45 <jivs> this error went away after I added 2nd conf_var

18:45 <jivs> Did u get this error? :-> bash: -c: line 0: unexpected EOF while looking for matching `"'

18:45 <jivs> bash: -c: line 1: syntax error: unexpected end of file

18:46 <jivs> paste here plz, bbiab

18:48 <bartbes> jivs: I never got the second, probably because I patched the first

19:14 <bartbes> jivs: I almost cracked it

19:16 <jivs> whats the error now?

19:17 <bartbes> I disabled the build rule and it complained about the lack of output

19:17 <bartbes> so I disabled that expectation too

19:21 <jivs> is it compiling now?

19:21 <bartbes> yeah

19:22 <bartbes> we'll see how this ends..

19:22 <bartbes> if it builds, I'll commit

19:22 <bartbes> testing can wait

19:22 <bartbes> I've spent enough time on this..

19:23 <jivs> ok

19:24 <jivs> Can you pastebin your Makefile plz. I am still getting that bash: -c error

19:24 <jivs> may be sthg wrong with my Makefile..

19:25 <bartbes> http://codepad.org/X5NpqBmf

19:26 <bartbes> I have 3 patches, though

19:29 <jivs> I am using 2.0.0

19:29 <jivs> let me try with 2.0.2, if anything changes

19:29 <bartbes> well

19:29 <bartbes> new result

19:29 <jivs> whats it?

19:29 <bartbes> it's probably not the docs, it's just the guile scripts seem to hang

19:30 <bartbes> the one that are supposed to execute on the host

19:30 <jivs> Do you have the same version of guile on the host?

19:30 <bartbes> no

19:30 <jivs> try updating ti 2.0.2, may be will do some good

19:31 <bartbes> yeah.. but isn't the target debian?

19:31 <bartbes> (which contains 1.8.7 afaik)

19:32 <bartbes> anyway, I'll zip this up

19:32 <jivs> i think we have to install from source..

19:32 <bartbes> so you can play with it

19:32 <bartbes> or anyone else who wants to

19:32 <jivs> btw what patches do u have for guile

19:32 <jivs> i needed ltdl, and unistring patch

19:33 <bartbes> I also have one for disabling the docs

19:33 <bartbes> unistring patch?

19:33 <bartbes> that's not for guile itself, is it?

19:33 <bartbes> I have one for i18n

19:34 <bartbes> http://dl.dropbox.com/u/440010/guile.tar.gz

19:34 <jivs> i needed toÂ Â change the configure.ac where it checks for libunistring... it was complaining in my case