Find Jobs
Hire Freelancers

We're looking for a mathematician

$15-25 USD / hour

Closed
Posted about 5 years ago

$15-25 USD / hour

Hello, We have texts composed of sections. In these sections, we sometimes have titles but we always have paragraphs. Inside paragraphs, we have one or several sentences. Sometimes, in one section, we have what we name "blocks". A block is a little group made of one title and one or several paragraphs. We are developing a tool and we need a mathematician to help us take the best decision. With this tool, users will import a batch of several hundreds or thousands of texts, gathered in 1 file. All these texts will have the same structure (same number of sections, blocks, titles, paragraphs and sentences, all in the same order. Users will then define: 1/ Select only a part of the elements: ● If we must use all sections for the output texts or if the tool must use between x and y sections upon the total number of sections that we have in the origin texts. ● Same with blocks inside sections. ● Same with paragraphs inside blocks and sections. ● Same with sentences inside paragraphs. ● If we can sometimes hide the title of a block or if we must "print" it in each and every output text. 2/ Swap some elements: ● If we can swap the sections to get them in a different order in each and every output text. ● Same with blocks inside sections. ● Same with paragraphs inside blocks and sections. ● Same with sentences inside paragraphs. If we have 37 456 texts in the input file, we must get 37456 texts in the output file. What we want is to get the most different structures in output between each text. We think this can be achieved by considering that each sentence/paragraph/title/block/section is a distinct element in a sequence. The goal would then be to use the principle of Hamming distance, to get the most different sequences in the output texts. But if you think there's a better way to achieve this goal, we're all ears. 1st question you will have to answer to: is it better to work with smaller sequences (one sequence for one paragraph, then one sequence for one block, then one sequence for one section, then one sequence to select and swap sections) or is it better to work sequences globally, gathering all the elements for 1 text inside 1 longer sequence? 2nd question: are you able to code the algorithm? (it's not mandatory, as long as you can explain the principles to a developer). Best regards, Marco.
Project ID: 18572610

About the project

17 proposals
Remote project
Active 5 yrs ago

Looking to make some money?

Benefits of bidding on Freelancer

Set your budget and timeframe
Get paid for your work
Outline your proposal
It's free to sign up and bid on jobs
17 freelancers are bidding on average $24 USD/hour for this job
User Avatar
Hello, I'm data scientist with huge expertise and mathematician with a number of publications. Also I'm participant and problem writer of many algorithm competitions (Topcoder, ACM ICPC). I can code it by myself, because I have huge expertise in writing different algorithms, including very complicated string search algorithms. Before anwering both questions I need to know your motivation. Do you want to create anti plagiarism tool or you have some other motivation? Feel free to contact me to discuss any details of the project. Looking forward to hearing from you!
$20 USD in 40 days
5.0 (26 reviews)
6.2
6.2
User Avatar
dear sir I am mathematics expert I can help you. I have experience to solve many mathematical problems
$20 USD in 40 days
4.9 (38 reviews)
5.5
5.5
User Avatar
hello friend! i am a math. teacher with 25 years of experience right from the basic level to advanced level dealing with almost all topics except few like game theory. i think i can cater your needs. thank you if you can understand my expertise.
$20 USD in 40 days
5.0 (11 reviews)
4.8
4.8
User Avatar
Hello. Since you have a lot of tests I think developing and AI/Deep Learning model can solve the problem. If you are interested we can talk about more details. Thanks, Helmot
$22 USD in 40 days
4.9 (5 reviews)
4.9
4.9
User Avatar
Dear employer, Hi I have done my M.Sc. thesis using Python and Matlab. It was about developing a numerical model for simulating fluids flow through porous media. I developed the main code in Python and developed my analyzer tools in Matlab. I learned lots of tricks in programming with these two great languages. I also had 2 big contracts. One of them was a contract with an educational institute and was about developing an Excel program for managing their workshops participants. I developed this program with Exel VBA. The other contract was about developing a numerical model for the search and rescue operation in the sea which I developed with C++. I have about 5 years of work experience in computer programming using different languages. It would be a great chance for me if we could collaborate with each other as I am an engineer who loves computer programming and solving the algorithms. My rating is a little low because one of my employers was a dealer and he did not want me to improve my business on this website. But honestly, I have some rules for my working life and their most important are: Be ON TIME, RESPONSIBLE, and RESPECTFUL. You can read the reviews of other employers on my profile. I am always here to answer your questions even after the project completion. It would be highly appreciated if you could send me the file that contains the text so we can discuss more. Regards, Amir
$20 USD in 40 days
4.8 (26 reviews)
4.4
4.4
User Avatar
Hello, I might be interested in this sort of project. I am particularly skilled in algorithm development and in Artificial Intelligence. I have over 10 years R&D experience and as a professional programmer. Please see my web site for examples of my work - the textflo system, for example does text processing. I have experience with the programming languages Java or C#. For your problem, I think that you have to recognise similar and different sections in your different documents and use that to index your documents on those similarities or differences. Statistical counts of keywords is important as well as the text sequence length, so it would depend on a number of factors. I typically use the messaging system in the first instance. If you could send further details, then I can get back to you.
$22 USD in 20 days
4.2 (2 reviews)
1.9
1.9
User Avatar
Hello! You have a very unusual and interesting task! I can help you as it's a data mining task. I'll python+pandas for this task to create an algorithm. Details of the solution depends on amount of data you have. 37 456 text is a big number, but what files sizes? Also, if your text has a super clear structure maybe it makes sense you use Elasticsearch to process it. Let's discuss it and please send an example of the file.
$27 USD in 40 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Joven ingeniero con solidos conocimiento,
$22 USD in 10 days
0.0 (0 reviews)
0.0
0.0
User Avatar
Postgraduate in mathematics. Teacher in mathematics Relevant Skills and Experience Knowledge of computer software.
$33 USD in 10 days
0.0 (0 reviews)
0.0
0.0
User Avatar
hello My name is taposh dhali. I completed BSc Engineering in electronics and Communication. I am an expert in mathematics and algorithm. I want to help you. This is a simple proposal because I want to prove my work.
$16 USD in 40 days
0.0 (0 reviews)
0.0
0.0
User Avatar
I am looking for company/organization to enhance my skills and potential, where I can get and provide a lot, and I wish to apply academics & professional knowledge to challenging tasks leading to growth and development of the company/organization.
$61 USD in 24 days
0.0 (0 reviews)
0.0
0.0

About the client

Flag of FRANCE
PARIS, France
5.0
46
Payment method verified
Member since Jan 14, 2011

Client Verification

Thanks! We’ve emailed you a link to claim your free credit.
Something went wrong while sending your email. Please try again.
Registered Users Total Jobs Posted
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Loading preview
Permission granted for Geolocation.
Your login session has expired and you have been logged out. Please log in again.