#rom-rb on 2013-12-29 — irc logs at freenode.irclog.whitequark.org

2013-06-04 15:26 solnic changed the topic of #rom-rb to: Ruby Object Mapper | Mailing List: https://groups.google.com/forum/?fromgroups#!forum/rom-rb | Logs: http://irclog.whitequark.org/rom-rb

00:18 coop-cooper has joined #rom-rb

00:33 coop-cooper has quit [Ping timeout: 245 seconds]

01:14 snusnu has quit [Quit: Leaving.]

02:35 lgierth has quit [Quit: Ex-Chat]

02:45 coop-cooper has joined #rom-rb

02:59 <coop-cooper> Hey guys, I've been watching the ROM project for awhile now and I'd like to contribute. I have seen the Roadmap on the website but I was wondering if there was a more detailed "todo" list that I could attempt to pick at?

03:36 coop-cooper has quit [Ping timeout: 252 seconds]

03:55 coop-cooper has joined #rom-rb

04:26 coop-cooper has quit [Ping timeout: 245 seconds]

04:29 snusnu1 has joined #rom-rb

04:32 snusnu1 has quit [Read error: Connection reset by peer]

06:28 coop-cooper has joined #rom-rb

06:43 coop-cooper has quit [Ping timeout: 252 seconds]

06:49 coop-cooper has joined #rom-rb

07:20 coop-cooper has quit [Ping timeout: 245 seconds]

09:40 mbj has joined #rom-rb

10:26 coop-cooper has joined #rom-rb

10:28 mbj has quit [Ping timeout: 272 seconds]

10:33 coop-cooper has quit [Ping timeout: 240 seconds]

10:33 coop-cooper has joined #rom-rb

10:55 coop-cooper has quit [Ping timeout: 272 seconds]

11:46 coop-cooper has joined #rom-rb

11:49 mbj has joined #rom-rb

12:08 coop-cooper has quit [Ping timeout: 272 seconds]

13:16 postmodern has quit [Quit: Leaving]

14:13 snusnu1 has joined #rom-rb

14:16 mbj has quit [Ping timeout: 246 seconds]

15:12 mbj has joined #rom-rb

16:45 jfredett-w has joined #rom-rb

16:45 jfredett-w1 has quit [Ping timeout: 240 seconds]

16:57 lgierth has joined #rom-rb

18:04 snusnu1 has quit [Ping timeout: 246 seconds]

18:05 snusnu1 has joined #rom-rb

18:10 Gibheer has quit [Ping timeout: 245 seconds]

18:11 lgierth has quit [Quit: Ex-Chat]

18:17 Gibheer has joined #rom-rb

18:43 snusnu1 has quit [Quit: Leaving.]

19:38 <dkubb> mbj: I'm nearing 1:1 feature parity with axiom-sql-generator. I was thinking about replacing the string concat with sql.rb and see if I can get all the specs to pass

19:39 <dkubb> mbj: then I'll probably change it to try to create inner joins when possible instead of nested subqueries

19:39 <dkubb> there's also a few other sql specific optimizations I can do

19:40 <dkubb> like transforming: col = 1 OR col = 2 OR col = 3 into col IN (1, 2, 3)

19:44 <mbj> Yeah

19:44 <mbj> dkubb: AST based transforms are many times more easy than optimizing the string concatenator.

19:44 <mbj> Also the SQL gem will be very usefull for other projects!

19:44 <dkubb> yeah

19:44 <dkubb> I'd love to work in the parser soon

19:45 <dkubb> I think it will provide some extra tension that'll help solidify the ast structure

19:45 <dkubb> right now it's basically optimized for generation

19:45 <dkubb> but I want to make sure it works well for parsing

19:45 <mbj> Yeah

19:45 <dkubb> plus when I work on the fuzzer, that'll add constraints from another dimension

19:45 <mbj> Optimizing down the number of nodes is a good strategy to ease parsing.

19:46 <mbj> I think you'll remove the "named for positional signalling" again.

19:46 <mbj> But TRY IT OUT FOR YOURSELF!

19:46 <mbj> :D

19:46 <dkubb> :)

19:46 <dkubb> yeah

19:46 <dkubb> I think positional is fine when the position is clear

19:46 <dkubb> anything "functional" in nature would be fine

19:46 <dkubb> like s(:and, node1, node2)

19:47 <dkubb> but a SELECT query doesn't really fit that mold too well

19:47 <dkubb> it's like a huge function with too many optional parameters

19:47 <mbj> I think this is a misdesign of SQL, having such a powerful SELECT node :D

19:48 <mbj> BTW both unparser and mutant have AST-processing backreferences.

19:48 <mbj> I try to reduce their use.

19:48 <dkubb> I'll check that out

19:49 <mbj> IMHO a good designed AST / Language will try to reduce the need for backreferences.

19:49 <dkubb> I'm still trying to decide if the parent should know about every possible child, or if the child should know what context to use depending on the parent

19:49 <mbj> I tried both approaches.

19:49 <dkubb> the problem with SQL is that some nodes can appear nearly anywhere

19:49 <mbj> Mutant / Unparser both are hybrid in this sense.

19:49 <dkubb> so every parent would have to know about every possible child

19:49 <mbj> First go for correctness.

19:50 <dkubb> I think it's going to be messy no matter what because SQL is messy

19:50 <mbj> Emitting superflownous parentheses is perfectly fine.

19:50 <dkubb> yeah, that's important to me

19:50 <dkubb> I want the emitted SQL to be as close to what I would write by hand

19:50 <mbj> In unparser foo << bar * baz gets emitted as foo << (bar * baz)

19:50 <dkubb> just like unparser

19:50 <mbj> dkubb: Do iterations!

19:50 <dkubb> well, yeah, correctness will be #1

19:50 <mbj> Dont try to write the first sql unparser with the #2 target in mind.

19:51 <mbj> Get it correct than make it pretty.

19:51 <mbj> I tried the same with unparser, and got far. With this strategy.

19:51 <dkubb> yeah, I will

19:51 <dkubb> I'll make sure the tests support me

19:51 <dkubb> fuzzing will be a big part of this

19:51 <dkubb> I need to get the parser working though

19:51 <dkubb> so I can do round-trip tests

19:53 <mbj> dkubb: I still need to do fuzzing for unparser.

19:54 <dkubb> fuzzing will probably uncover lots of bugs in parser

19:54 <dkubb> when I fuzzed axiom-sql-generator I found more bugs in the parsing of the SQL than I did in the generation

19:54 <mbj> I know whitequark regulary tests parser against the rubygems corpus.

19:54 <dkubb> yeah, that's a good idea

19:54 <mbj> BUT he cannot verify correctness with this technique.

19:55 <mbj> Only parser + unparser could be used for verification.

19:55 <mbj> If you can execute the tests after unparsing.

19:55 <mbj> Only problem are __LINE__ references.

19:55 <mbj> But these could be emitted hardcoded.

19:55 <dkubb> I wonder if you could use a markov chain to do the fuzzing

19:56 <mbj> Good idea.

19:56 <mbj> Have to try.

19:56 <mbj> Additional idea: I wanna have a set of predicates per node that define if a node is valid.

19:56 <mbj> For example s(:send, nil, nil, nil) is NOT valid.

19:56 <mbj> s(:send, :foo, :baz) is valid ":foo.baz"

19:56 <mbj> s(:send, :foo, :baz) is valid "foo.baz" * sorry

19:56 <mbj> nah

19:56 <mbj> both are invalid :D

19:57 <dkubb> I still wonder if a lightweight "Send" object that has a #to_ast method that returns the ast might be better, then the validity checking can be encpasulated in the object

19:57 <mbj> s(:send, nil, :baz) this one is valid.

19:57 <mbj> I'm talking about our generic fuzzer.

19:57 <mbj> The fuzzer must have some predicates that allow him to make sure no invalid AST gets generated.

19:57 <mbj> Sure we can always add dedicated fuzzer objects per type. (mutant mutation emitter like).

19:58 <mbj> But make we can write a generic fuzzer if you have predicates on the nodes.

19:58 <dkubb> a markov chain would define the range allowed values for each node

19:58 <dkubb> *range of

19:58 <mbj> Yeah, a markov chain is the inverse of my predicates on nodes idea.

19:58 <mbj> Or maybe the same :D

19:59 <dkubb> so you start at the root, then randomly pick a node, then randomly pick some valid values for that node

19:59 <dkubb> you could have some limits on the depth of the tree

20:00 <mbj> dkubb: https://github.com/whitequark/parser/issues/121#issuecomment-31211771

20:00 <dkubb> for any given node, other nodes could be valid arguments, so it could go on infinitely without some limit

20:00 <mbj> I'll do a PR soon. Will help me to simplify unparser / mutant.

20:00 <dkubb> ahh, that's cool

20:00 <mbj> to provide a better selfcare.

20:00 <mbj> s/selfcare/selftest/

20:00 <dkubb> sql.rb should provide a list of valid node types

20:00 <mbj> yeah

20:01 <dkubb> simply by looking to see what names are registered

20:01 <mbj> exactly

20:01 <dkubb> btw, I would usually not have a constant be part of the public interface.. instead of Unparser::Emitter::REGISTRY.key?(type) you could have something like: Unparser::Emitter.handler(type)

20:02 <mbj> yeah

20:02 <dkubb> I've found I much prefer having a method that returns the handler, as opposed to having a key test

20:02 <mbj> Sure.

20:02 <dkubb> it allows me, when necessary, to create objects on-the-fly for handling

20:02 <mbj> It was just easier to explain this way.

20:02 <dkubb> instead of always relying on static handlers

20:02 <dkubb> I did this in axiom-types and it simplified some stuff

20:03 <mbj> For example I'd love to see Parser::CurrentRuby.node_types => Set.new(:lvar, :ivar, .... )

20:03 <dkubb> that would be really nice

20:03 <mbj> Parser::CurrentRuby.parse is the public interface.

20:03 <dkubb> maybe your PR should include that

20:04 <mbj> Dunno, making up a list of all ruby version specific nodes is to much for my OSS time frame.

20:04 <dkubb> heh

20:04 <mbj> 1.8 has :not

20:04 <mbj> 1.9 not has :not

20:04 <mbj> etc.

20:04 <dkubb> just get ahold of whitequarks framework for testing rubygems

20:04 <dkubb> then have it run through the gems and write out the nodes, per ruby type, to a file

20:04 <mbj> good idea.

20:04 <dkubb> then it's a matter of letting it run

20:05 <mbj> But this will take longer than Parser::Meta::NODE_TYPES

20:05 <mbj> And I need this to relihable close that :until bug in mutant :D

20:05 <mbj> I dislike to close bugs without solving the infrastructure problems behind it.

20:07 <mbj> For mutant I'll just add a blacklist of the 1.8 nodes when interpreting Parser::Meta::NODE_TYPES

21:26 <mbj> dkubb: loops controlled by until and while could be named as: conditional loop? (searching the name for the Mutator::Node subclass for both).

23:26 _br_ has quit [Ping timeout: 272 seconds]

23:28 _br_ has joined #rom-rb

23:57 postmodern has joined #rom-rb