Incorrect splitting into sentences

Hi all! I am talking about [this](https://github.com/cameronsutter/odyssey/blob/fd13e9e5bede733ceff4b6a95f2ac516c19cb5f9/lib/odyssey/engine.rb#L18) regular expression, which is later used to coun sentences:
```
SENTENCE_REGEX = /[^\.!?\s][^\.!?]*(?:[\.!?](?!['"]?\s|$)[^\.!?]*)*[\.!?]?['"]?(?=\s|$)/
```

For texts like "Mr. Smith is a doctor" this will give two sentences: `["Mr.", "Smith is a doctor"]` resulting in incorrect readability scores.
Maybe there is a way to improve it and exclude some common titles (such as "Mr" or "Dr") from this regular expression? 

I am not very good at using `scan` method but if we use `split` we can probably use an expression similar to this:
```
(?<!\w\.\w.)(?<![A-Z][a-z]\.)(?<=\.|\?|!|(\."))\s
```
which is also not at all perfect because it will catch "Mr." and "Dr." but not "Mrs." (still better than nothing ☺ ).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect splitting into sentences #39

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Incorrect splitting into sentences #39

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions