This post provides the instructions to use Apache Solr with Python in different ways.
======using Pysolr
Below are two small python snippets that the author of the post used for testing writing to and reading from a new SOLR server.
The script below will attempt to add a document to the SOLR server.
# Using Python 2.X
from __future__ import print_function
import pysolr
# Setup a basic Solr instance. The timeout is optional.
solr = pysolr.Solr('http://some-solr-server.com:8080/solr/', timeout=10)
# How you would index data.
solr.add([
{
"id": "doc_1",
"title": "A very small test document about elmo",
}
])
The snippet below will attempt to search for the document that was just added from the snippet above.
# Using Python 2.X
from __future__ import print_function
import pysolr
# Setup a basic Solr instance. The timeout is optional.
solr = pysolr.Solr('http://some-solr-server.com:8080/solr/', timeout=10)
results = solr.search('elmo')
print("Saw {0} result(s).".format(len(results)))
======GitHub repos
- Pysolr (GitHub repo)
pysolr is a lightweight Python wrapper for Apache Solr. It provides an interface that queries the server and returns results based on the query.
install Pysolr using pip
pip install pysolr
Multicore Index
Simply point the URL to the index core:
# Setup a Solr instance. The timeout is optional.
solr = pysolr.Solr('http://localhost:8983/solr/core_0/', timeout=10)
- SolrClient (GitHub repo, its API document)
SolrClient is a simple python library for Solr; built in python3 with support for latest features of Solr.
References:
- Index documents in python using json
-
Using Python (Apache Solr Reference Guide )