28 Jul 2011 16:34
SQLite join takes a very long time
All- I have a set of text files, each with 410 million records (with a structure something like these 5-record examples) that I'd like to bring into SQLite: file a = [-99, 1, 1, -99, -99] file b = [-99, 2, 4, 1, -99] file c = [-99, 7, 7, -99, -99] The -99 values are NA; location in the vector matters. I'd like as my final output a table looking like this ID a b c 1 1 2 7 2 1 4 7 With ID being the location in the vector (= row) and only rows without NA in the table. With a test dataset of 6 files of 10 million records each, I've tried importing each file independently to separate tables of two columns each that look like this: table_a ID a 0 -99 1 1 2 1 3 -99 4 -99(Continue reading)
RSS Feed