Skip to content

nthreads/elasticsearch-ingest-email-extract

Repository files navigation

Elasticsearch email-extract Ingest Processor

Explain the use case of this processor in a TLDR fashion.

Usage

PUT _ingest/pipeline/email-extract-pipeline
{
  "description": "Ingest processor that extracts emails and store it in an array of different fields",
  "processors": [
    {
      "email_extract" : {
        "field" : "my_field"
      }
    }
  ]
}

PUT /my-index/my-type/1?pipeline=email-extract-pipeline
{
  "my_field" : "You can reach out to me at nauman@csms.ae or at nauman.zubair@gmail.com or commenting on www.nthreads.com"
}

GET /my-index/my-type/1
{
  "my_field" : "Some content"
  "potentially_enriched_field": "potentially_enriched_value"
}

Configuration

Parameter Use
some.setting Configure x
other.setting Configure y

Setup

In order to install this plugin, you need to create a zip distribution first by running

gradle clean check

This will produce a zip file in build/distributions.

After building the zip file, you can install it like this

bin/elasticsearch-plugin install file:///path/to/ingest-email-extract/build/distribution/ingest-email-extract-0.0.1-SNAPSHOT.zip

Bugs & TODO

  • Add Link extraction

About

Ingest processor that extracts emails and store it in an array of different fields

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages