Code to find SIMILARITIES between two text files

Completed Posted Dec 7, 2010 Paid on delivery
Completed Paid on delivery

This is NOT as easy as it sounds. Don't be fooled - if you dont know what you're doing, don't apply.

I want a small application written that can examine two (LARGE) text files and find identical lines of matching text.

This is NOT a 'text difference' application, this is a text SIMILARITY tool.

For instance, if I have a 1GB text file (File A) and I want to compare it against a 500Meg text file (File B) - I need this application to list the parts of each file that are identical.

1) You will need to be able to select the line length, i.e. 20 chars, 15 chars etc, and it must then try to find ANY 20 characters in File A that are also present in File B.

2) options for case sensitivity, whitespace removal, carriage returns etc - I dont want an identical line to be missed simply because it was split by a carriage return or had an extra formatting tab or space in it.

1) I dont care what language you write this in

2) The application MUST BE FAST!

3) If you can use multi core multi threads even better.

results must be legible.

4) It must work properly. No repeated results listing the same string 1000 times because your algorithm doesn't work. Similarly no missing out results either please

Sounds easy huh :-)

Best of luck.

Engineering Microsoft Project Management Software Architecture Software Testing Windows Desktop

Project ID: #2972485

About the project

15 proposals Remote project Active Dec 7, 2010

Awarded to:

AlexNaumov

See private message.

$84.15 USD in 5 days
(89 Reviews)
6.2

15 freelancers are bidding on average $64 for this job

wassily

See private message.

$85 USD in 5 days
(38 Reviews)
6.5
quickprogexpert

See private message.

$68 USD in 5 days
(138 Reviews)
6.3
muhammadilyas14

See private message.

$76.5 USD in 5 days
(28 Reviews)
3.9
readyfacts

See private message.

$51 USD in 5 days
(22 Reviews)
3.9
hammansamuel

See private message.

$85 USD in 5 days
(10 Reviews)
3.5
abbasi99

See private message.

$85 USD in 5 days
(10 Reviews)
3.8
TaimoorTakkar

See private message.

$68 USD in 5 days
(2 Reviews)
0.9
saadmohamedvw

See private message.

$42.5 USD in 5 days
(2 Reviews)
0.5
sumiranlingwal

See private message.

$42.5 USD in 5 days
(0 Reviews)
0.0
tecknik

See private message.

$85 USD in 5 days
(0 Reviews)
0.0
Athanas

See private message.

$12.75 USD in 5 days
(2 Reviews)
0.2
smzmali

See private message.

$51 USD in 5 days
(0 Reviews)
0.0
erpoojasharma

See private message.

$85 USD in 5 days
(1 Review)
0.0
infyamrita

See private message.

$42.5 USD in 5 days
(0 Reviews)
0.0