How do I insert punctuation in unpunctuated text?
Show older comments
Hi all,
I am currently working on a very fun project, but unfortunately came across a problem I haven't been able to solve for some time. What I am trying to do is punctuate text that contains no punctuation. At the time of writing I have a lot of text files that contain proper punctuation, and matching text files without punctuation.
Initially, I thought that Matlab would have some Neural Network that I could train with the input and output files I have but no, unfortunately not.
Therefor I am reaching out to you and hope there is someone who can help me punctuate unpunctuated text.
8 Comments
Jan
on 22 Jan 2018
Please give us a small example.
Walter Roberson
on 22 Jan 2018
You might get some ideas from https://www.mathworks.com/help/textanalytics/examples/classify-text-data-using-deep-learning.html
You would not remove the punctuation in your case -- but you could convert them into tokens.
r r
on 22 Jan 2018
r r
on 22 Jan 2018
Guillaume
on 22 Jan 2018
I have no idea how to solve this. It seems a very difficult problem, particularly as there seems to be many valid ways of applying punctuation to the given sample. e.g:
"What's happened to me?" he thought. It wasn't a dream.
What's happened to me? He thought it wasn't a dream.
r r
on 22 Jan 2018
Guillaume
on 22 Jan 2018
No neural network is going to be able to say which is more correct of:
What's happened to me? He thought. It wasn't a dream.
"What's happened to me?", he thought. It wasn't a dream.
What's happened to me? He thought it wasn't a dream.
Without a ton of context no human can do that either. And even with context, it can still be ambiguous.
The traditional example:
Eats shoots and leaves. (Panda)
Eats, shoots, and leaves. (Gunman)
r r
on 22 Jan 2018
Answers (0)
Categories
Find more on Deep Learning Toolbox in Help Center and File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!