diff options
| author | Nathan Reiner <nathan@nathanreiner.xyz> | 2023-07-06 11:51:21 +0200 |
|---|---|---|
| committer | Nathan Reiner <nathan@nathanreiner.xyz> | 2023-07-06 11:51:21 +0200 |
| commit | e1770cf3b0fd5eff3e69a8ec28c15018084eae73 (patch) | |
| tree | 0fc6289cd8b56f654a760d1ee7d748d160bcc251 /src/text/pdf.rs | |
| parent | 3ca9adc0c5e138271dacab7691dac77da0ba0f21 (diff) | |
add extractors for docx, pptx, pdf, etc.
Diffstat (limited to 'src/text/pdf.rs')
| -rw-r--r-- | src/text/pdf.rs | 5 |
1 files changed, 5 insertions, 0 deletions
diff --git a/src/text/pdf.rs b/src/text/pdf.rs new file mode 100644 index 0000000..efa441f --- /dev/null +++ b/src/text/pdf.rs @@ -0,0 +1,5 @@ +use crate::extractors::pdf; + +pub fn get_text(path : &str) -> String { + pdf::pdf2text(path).ok().unwrap_or_else(|| "".to_string()) +} |