Nicolas Patry
d8fc26e919
NerPipeline (TokenClassification) now outputs offsets of words ( #8781 )
...
* NerPipeline (TokenClassification) now outputs offsets of words
- It happens that the offsets are missing, it forces the user to pattern
match the "word" from his input, which is not always feasible.
For instance if a sentence contains the same word twice, then there
is no way to know which is which.
- This PR proposes to fix that by outputting 2 new keys for this
pipelines outputs, "start" and "end", which correspond to the string
offsets of the word. That means that we should always have the
invariant:
```python
input[entity["start"]: entity["end"]] == entity["entity_group"]
# or entity["entity"] if not grouped
```
* Fixing doc style
2020-11-30 14:05:08 -05:00
..
2020-11-09 11:39:55 -05:00
2020-01-06 15:11:12 +01:00
2020-11-03 16:57:12 -05:00
2020-10-30 15:09:10 -04:00
2020-10-30 15:09:10 -04:00
2020-09-23 13:25:24 -04:00
2020-09-23 13:25:24 -04:00
2020-08-27 20:01:17 -04:00
2020-11-16 21:43:42 -05:00
2020-10-22 15:39:01 +02:00
2020-11-04 17:24:49 -05:00
2020-11-02 08:47:37 -05:00
2020-11-10 07:11:02 -05:00
2020-11-16 21:43:42 -05:00
2020-11-03 16:04:22 +01:00
2020-11-18 17:06:25 +01:00
2020-11-20 22:07:21 +01:00
2020-11-11 12:59:40 -05:00
2020-10-29 10:56:25 -04:00
2020-11-16 21:43:42 -05:00
2020-01-06 15:11:12 +01:00
2020-09-11 12:01:33 -04:00
2020-11-24 14:40:53 +01:00
2020-11-16 21:43:42 -05:00
2020-11-24 09:35:12 -05:00
2020-11-16 11:43:00 -05:00
2020-11-24 14:40:53 +01:00
2020-11-03 16:04:22 +01:00
2020-11-16 11:43:00 -05:00
2020-11-27 16:41:29 -05:00
2020-11-16 11:43:00 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 11:43:00 -05:00
2020-11-27 16:26:45 +01:00
2020-11-24 14:40:53 +01:00
2020-11-23 13:28:29 +01:00
2020-11-16 21:43:42 -05:00
2020-11-30 13:43:17 -05:00
2020-11-30 13:43:17 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 11:43:00 -05:00
2020-11-23 14:41:23 -05:00
2020-11-24 14:40:53 +01:00
2020-11-25 22:06:04 +01:00
2020-11-25 22:06:04 +01:00
2020-11-23 12:33:13 -08:00
2020-11-24 09:35:12 -05:00
2020-11-16 11:43:00 -05:00
2020-11-17 12:23:09 +01:00
2020-11-16 11:43:00 -05:00
2020-11-23 12:33:13 -08:00
2020-11-25 22:06:04 +01:00
2020-11-30 08:34:40 +01:00
2020-11-25 22:06:04 +01:00
2020-11-24 14:40:53 +01:00
2020-11-16 11:43:00 -05:00
2020-11-30 08:34:40 +01:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-24 13:55:00 -05:00
2020-11-23 18:20:19 -05:00
2020-11-24 13:55:00 -05:00
2020-11-16 11:43:00 -05:00
2020-11-24 13:55:00 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-19 10:41:41 -05:00
2020-11-19 10:41:41 -05:00
2020-11-16 11:43:00 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-19 10:37:27 -05:00
2020-11-16 21:43:42 -05:00
2020-10-30 11:23:16 -04:00
2020-10-30 11:23:16 -04:00
2020-11-16 21:43:42 -05:00
2020-11-17 12:23:09 +01:00
2020-11-16 21:43:42 -05:00
2020-10-30 11:23:16 -04:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-30 08:34:40 +01:00
2020-11-16 11:43:00 -05:00
2020-11-16 11:43:00 -05:00
2020-11-16 11:43:00 -05:00
2020-11-25 16:54:59 -05:00
2020-11-25 22:06:04 +01:00
2020-10-30 10:25:48 -04:00
2020-11-16 11:43:00 -05:00
2020-11-16 21:43:42 -05:00
2020-11-25 16:54:59 -05:00
2020-10-30 10:25:48 -04:00
2020-08-24 11:03:01 -04:00
2020-08-27 18:31:51 +02:00
2020-11-15 22:50:59 +01:00
2020-11-03 10:33:01 -05:00
2020-10-23 15:58:19 +02:00
2020-11-17 15:11:29 -05:00
2020-11-30 14:05:08 -05:00
2020-11-25 14:02:15 -05:00
2020-10-23 15:58:19 +02:00
2020-10-23 15:58:19 +02:00
2020-10-23 15:58:19 +02:00
2020-11-03 10:10:22 -05:00
2020-10-23 15:58:19 +02:00
2020-11-15 22:50:59 +01:00
2020-11-16 21:43:42 -05:00
2020-10-20 04:39:13 -04:00
2020-10-18 20:51:24 +02:00
2020-11-24 09:50:25 -05:00
2020-11-16 21:43:42 -05:00
2020-11-27 12:31:42 -05:00
2020-10-18 20:51:24 +02:00
2020-11-23 11:15:02 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-10-18 20:51:24 +02:00
2020-11-17 14:03:03 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-10-18 20:51:24 +02:00
2020-10-18 20:51:24 +02:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-19 16:44:20 -05:00
2020-11-16 21:43:42 -05:00
2020-11-29 16:57:43 +01:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-11-16 21:43:42 -05:00
2020-10-23 10:29:15 -04:00
2020-11-16 21:43:42 -05:00
2020-10-18 20:51:24 +02:00
2020-11-10 14:21:54 -05:00
2020-11-16 21:43:42 -05:00
2020-11-19 12:06:01 -08:00
2020-11-16 21:43:42 -05:00
2020-10-18 20:51:24 +02:00
2020-11-16 21:43:42 -05:00
2020-10-18 20:51:24 +02:00
2020-10-27 10:37:04 -04:00
2020-11-10 13:23:58 -05:00
2020-10-14 11:41:45 -04:00
2020-10-14 11:41:45 -04:00
2020-11-23 17:25:35 -05:00
2020-11-16 21:43:42 -05:00
2020-11-24 13:22:25 -05:00