Solr 7

Solr 7 7.5.0

No permission to download
  • Versions of Major Components (5)
    1. Apache Tika 1.18
    2. Carrot2 3.16.0
    3. Velocity 1.7 and Velocity Tools 2.0
    4. Apache ZooKeeper 3.4.11
    5. Jetty 9.4.11.v20180605
  • Upgrade Notes (5)
    1. The data-driven schema no longer returns the *_str field-copy of text fields by default. The *_str field is still created and can be used for e.g. sorting, but to retrieve it you now need to explicitly list it in "fl" (SOLR-12350)

    2. LUCENE-7976: TieredMergePolicy now respects maxSegmentSizeMB by default when executing findForcedMerges (optimize) and findForcedDeletesMerges (expungeDeletes)
      (Erick Erickson)
    3. SOLR-12395: SignificantTermsQParserPlugin's name is now 'significantTerms' and its old name 'sigificantTerms' is deprecated.

    4. SOLR-11694: Extremely outdated UIMA contrib module has been removed
      (Alexandre Rafalovitch)
    5. SOLR-12008: The configuration file for log4j2.xml is now in ../server/resources/log4j2.xml. All other copies have been removed. WINDOWS USERS: This JIRA corrects a bug in the start scripts that put example logs under ...\server, solr.log is now under ...\example.
      (Erick Erickson)
  • New Features (42)
    1. SOLR-11865: The QueryElevationComponent now has a useConfiguredElevatedOrder setting. When multiple docs are elevated, this specifies whether their relative order should be the order in the configuration file or if not then should they be subject to whatever the sort criteria is. Additionally, QEC was extensively refactored to be more extensible.
      (Bruno Roustant, David Smiley)
    2. SOLR-12474: Add an UpdateRequest Object that implements RequestWriter.ContentWriter
      (noble)
    3. SOLR-12361: Allow nested child documents to be in field values of a SolrInputDocument as an alternative to add/get ChildDocuments off to the side. The latter is now referred to as "anonymous" child documents as opposed to "labelled" (by the field name). Anonymous child docs might be deprecated in the future. This is an internal change that should work for javabin/SolrJ; separate issues will address XML & JSON formats populating nested docs in this way. AddUpdateCommand and it's relationship with DirectUpdateHandler2 was reworked substantially.
      (Moshe Bla, David Smiley)
    4. SOLR-12362: Uploading docs in JSON now supports child documents as field values, thus providing a label to the relationship instead of the current "anonymous" relationship. Use of this experimental feature sometimes requires a anonChildDocs=false parameter until Solr 8 due to syntax ambiguities.
      (Moshe Bla, David Smiley)
    5. SOLR-12485: Uploading docs in XML now supports child documents as field values, thus providing a label to the relationship instead of the current "anonymous" relationship.
      (Moshe Bla, David Smiley)
    6. SOLR-12441: (EXPERIMENTAL) New NestedUpdateProcessorFactory (URP) to populate special fields _nest_parent_ and _nest_path_ of nested (child) documents. It will generate a uniqueKey of nested docs if they were blank too.
      (Moshe Bla, David Smiley)
    7. SOLR-12519: The [child] transformer now returns a nested child doc structure (attached as fields if provided this way) provided the schema has the _nest_path_ field. This is part of a broader enhancement of nested docs.
      (Moshe Bla, David Smiley)
    8. SOLR-12722: The [child] transformer now takes an 'fl' param to specify which fields to return. It will evaluate doc transformers if present. In 7.5 a missing 'fl' defaults to the current behavior of all fields, but in 8.0 defaults to the top/request "fl".
      (Moshe Bla, David Smiley)
    9. SOLR-11578: Solr 7 Admin UI (Cloud > Graph) should reflect the Replica type to give a more accurate representation of the cluster.
      (Rhoit Singh via Erick Erickson)
    10. SOLR-12507: Modify collection API should support un-setting properties.
      (shalin)
    11. SOLR-12506: Add SolrJ support for the modify collection API.
      (shalin)
    12. SOLR-12398: The JSON Facet API now supports type=heatmap facets, just as classic faceting does.
      (David Smiley)
    13. SOLR-11985: Support percentage values in replica attribute in autoscaling policy
      (noble)
    14. SOLR-12511: Support non integer values for replica in autoscaling policy
      (noble)
    15. SOLR-12517: Support range values for replica in autoscaling policy
      (noble)
    16. SOLR-12530: Ability to disable configset upload via -Dconfigset.upload.enabled=false startup parameter
      (Ishan Chattopadhyaya)
    17. SOLR-12495: An #EQUAL function for replica in autoscaling policy to equally distribute replicas
      (noble)
    18. SOLR-11986: Allow percentage in freedisk attribute in autoscaling policy rules
      (noble)
    19. SOLR-12522: Support a runtime function `#ALL` for 'replica' in autoscaling policies
      (noble)
    20. SOLR-12567: JSON Facet "functions" now support an extended "type:func" syntax, similar to other types of facets. This also allows additional local params to be specified for if the aggregation function can take advantage of them.
      (hossman)
    21. SOLR-12581: the JSON Facet 'relatedness()' aggregate function now supports a 'min_popularity' option using the extended type:func syntax
      (hossman)
    22. SOLR-12536: autoscaling policy support to equally distribute replicas on the basis of arbitrary properties
      (noble)
    23. SOLR-8207: Add "Nodes" view to the Admin UI "Cloud" tab, listing nodes and key metrics
      (janhoy)
    24. SOLR-7767: "ZK Status" sub menu under "Cloud" tab to see status of zookeeper ensemble
      (janhoy)
    25. SOLR-11990: Make it possible to co-locate replicas of multiple collections together in a node. A collection may be co-located with another collection during collection creation time by specifying a 'withCollection' parameter. It can also be co-located afterwards by using the modify collection API. The co-location guarantee is enforced regardless of future cluster operations whether they are invoked manually via the Collection API or by the Autoscaling framework.
      (noble, shalin)
    26. SOLR-12402: Factor out SolrDefaultStreamFactory class.
      (Christine Poerschke)
    27. SOLR-12592: support #EQUAL function, range operator, decimal and percentage in cores in autoscaling policies
      (noble)
    28. SOLR-12655: Add Korean morphological analyzer ("nori") to default distribution. This also adds examples for configuration in Solr's schema.
      (Uwe Schindler)
    29. SOLR-11863: Add knnRegress Stream Evaluator to support nearest neighbor regression
      (Joel Bernstein)
    30. SOLR-12702: Add zscores Stream Evaluator
      (Joel Bernstein)
    31. SOLR-12687: Add functions to cache data structures and mathematical models
      (Joel Bernstein)
    32. SOLR-12671: Add robust flag to knnRegress Stream Evaluator
      (Joel Bernstein)
    33. SOLR-12660: Add outliers Stream Evaluator to support outlier detection with probability distributions
      (Joel Bernstein)
    34. SOLR-12634: Add gaussfit Stream Evaluator
      (Joel Bernstein)
    35. SOLR-12629: The predict evaluator should work with the polyfit function
      (Joel Bernstein)
    36. SOLR-12715: NodeAddedTrigger should support adding replicas to new nodes by setting preferredOperation=addreplica.
      (shalin)
    37. SOLR-11861: When creating a configSet via the API, the "baseConfigSet" parameter now defaults to "_default".
      (Amrit Sarkar, David Smiley)
    38. SOLR-12716: NodeLostTrigger should support deleting replicas from lost nodes by setting preferredOperation=deletenode.
      (shalin)
    39. SOLR-9418: Added a new (experimental) PhrasesIdentificationComponent for identifying potential phrases in query input based on overlapping shingles in the index.
      (Akash Mehta, Trey Grainger, hossman)
    40. SOLR-11943: Add machine learning functions for location data
      (Joel Bernstein)
    41. SOLR-12612: Cluster properties restriction of known keys only is relaxed, and now unknown properties starting with "ext." will be allowed. This allows custom to plugins set their own cluster properties.
      (Jeffery Yuan via Tomás Fernández Löbbe)
    42. SOLR-12357: Time Routed Aliases now have a preemptiveCreateMath option to preemptively and asynchronously create the next collection in advance as new data gets within this time window of the end.
      (Gus Heck, David Smiley)
  • Bug Fixes (43)
    1. SOLR-12449: Response /autoscaling/diagnostics shows improper json
      (noble)
    2. SOLR-11676: Keep nrtReplicas and replicationFactor in sync while creating a collection and modifying a collection
      (Varun Thacker)
    3. SOLR-12489: User specified replicationFactor and maxShardsPerNode is used when specified during a restore operation. A user can now specify nrtReplicas/tlogReplicas/pullReplicas while restoring the collection. Specifying replicationFactor or nrtReplicas have the same effect and only one can be specified
      (Varun Thacker)
    4. SOLR-11216: Race condition in PeerSync
      (Cao Manh Dat)
    5. SOLR-11807: Restoring collection now treats maxShardsPerNode=-1 as unlimited
      (Varun Thacker)
    6. SOLR-12413: If Zookeeper was pre-loaded with data before first-use, then the aliases information would be ignored.
      (David Smiley, Gaël Jourdan, Gus Heck)
    7. SOLR-12482: Config API returns status 0 for failed operations.
      (Steve Rowe)
    8. SOLR-12513: Reproducing TestCodecSupport.testMixedCompressionMode failure
      (Erick Erickson)
    9. SOLR-11665: Improve error handling of shard splitting. Fix splitting of mixed replica types.
      (ab)
    10. SOLR-12326: JSON Facet API: terms facet shard requests now indicate if they have more buckets to prevent unnecessary refinement requests.
      (yonk)
    11. SOLR-12427: Improve error message for invalid 'start', 'rows' parameters.
      (Munendra S N via Jason Gerlowski)
    12. SOLR-12395: Make 'significantTerms' the SignificantTermsQParserPlugin's name and deprecate its old 'sigificantTerms' name.
      (Tobias Kässmann, Christine Poerschke)
    13. SOLR-12533 Collection collection fails if metrics are called during core creation
      (Peter Cseh, Mano Kovacs)
    14. SOLR-2834: Fix SolrJ Field and Document analyzes for types that include CharacterFilter
      (Alexandre Rafalovitch)
    15. SOLR-12516: Fix some bugs in 'type:range' Facet refinement when sub-facets are combined with non default values for the 'other' and 'include' options.
      (hossman)
    16. SOLR-12343: Fixed a bug in JSON Faceting that could cause incorrect counts/stats when using non default sort options. This also adds a new configurable "overrefine" option.
      (Yonik Seeley, hossman)
    17. SOLR-12553: Allow SignificantTerms Query Parser to use local parameters
      (Alexandre Rafalovitch)
    18. SOLR-12570: OpenNLPExtractNamedEntitiesUpdateProcessor cannot support multi fields because pattern replacement doesn't work correctly.
      (Koji Sekiguchi)
    19. SOLR-12576: Update ref guide for additional information displayed in cloud view
      (Erick Erickson)
    20. SOLR-12597: Migrate API should fail requests that do not specify split.key parameter
      (shalin)
    21. SOLR-12477: An update would return a client error(400) if it hit a AlreadyClosedException. We now return the error as a server error(500) instead
      (Jeffery via Varun Thacker)
    22. SOLR-12606: Fix InfixSuggestersTest.testShutdownDuringBuild() failures.
      (Steve Rowe)
    23. SOLR-12607: Fixed two separate bugs in shard splits which can cause data loss. The first case is when using TLOG replicas only, the updates forwarded from parent shard leader to the sub-shard leader are written only in tlog and not the index. If this happens after the buffered updates have been replayed then the updates can never be executed even though they remain the transaction log. The second case is when synchronously forwarding updates to sub-shard leader fails and the underlying errors are not propagated to the client.
      (Cao Manh Dat, shalin)
    24. SOLR-12344: SolrSlf4jReporter doesn't set MDC context.
      (ab)
    25. SOLR-12594: MetricsHistoryHandler.getOverseerLeader fails when hostname contains hyphen.
      (ab)
    26. SOLR-12615: HashQParserPlugin will no longer throw an NPE if the hash key field is a string when there are documents with empty values. All documents with empty values ( string , numeric ) will be processed by worker=0 This would fix the NPE when using the search stream with partitionKeys.
      (Varun Thacker)
    27. SOLR-11770: NPE in tvrh if no field is specified and document doesn't contain any fields with term vectors

    28. SOLR-12541: Metrics handler throws an error if there are transient cores.
      (ab)
    29. SOLR-12470: Search Rate Trigger multiple bug fixes, improvements and documentation updates.
      (ab)
    30. SOLR-12665: Autoscaling policy not being refreshed due to caching
      (noble)
    31. SOLR-12649: CloudSolrClient retries requests unnecessarily exception from server
      (noble, shalin)
    32. SOLR-12670: RecoveryStrategy logs wrong wait time when retrying recovery.
      (shalin)
    33. SOLR-12668: Autoscaling trigger listeners should be executed in the order of their creation.
      (ab)
    34. SOLR-12475: Fix MaxSizeAutoCommitTest failures
      (Rupa Shankar, Anshum Gupta)
    35. SOLR-12674: RollupStream should not use the HashQueryParser for 1 worker.
      (Varun Thacker)
    36. SOLR-12679: MiniSolrCloudCluster internal jetty list should never have duplicates
      (shalin)
    37. SOLR-12598: Do not fetch non-stored fields
      (Nikolay Khitrin, Erick Erickson)
    38. SOLR-12683: HashQuery will throw an exception if more than 4 partitionKeys is specified. Earlier after the 4th partitionKey the keys would be silently ignored.
      (Varun Thacker)
    39. SOLR-10028: Fix and improvements to SegmentsInfoRequestHandlerTest
      (Christine Poerschke, Tomás Fernández Löbbe)
    40. SOLR-11585: Solr SQL does not work with point numeric fields
      (Joel Bernstein, Kiran Chitturi)
    41. SOLR-12704: Guard AddSchemaFieldsUpdateProcessorFactory against null field names and field values.
      (Steve Rowe, Varun Thacker)
    42. SOLR-12733: SolrMetricReporterTest failure
      (Erick Erickson, David Smiley)
    43. SOLR-12765: Incorrect format of JMX cache stats.
      (Bojan Smid, ab)
  • Optimizations (12)
    1. SOLR-12350: Do not use docValues as stored for _str (copy)fields in _default configset
      (janhoy)
    2. SOLR-12455: Refactor JSON serialization code into SolrJ package
      (noble)
    3. SOLR-11654: Time Routed Alias will now route documents to the ideal shard of a collection, thus avoiding a hop. Usually documents were already routed well but not always.
      (Gus Heck, David Smiley)
    4. SOLR-12305: When a replica is applying updates, some kind of updates can skip buffering for faster recovery.
      (Cao Manh Dat)
    5. SOLR-11598: The export handler does not limit users to 4 sort fields and is now unlimited. However the speed at which we can export is directly proportional to the number of sort fields specified. This change also allows streaming expressions to group by on more than 4 fields.
      (Aroop Ganguly, Amrit Sarkar, Varun Thacker)
    6. SOLR-12509: Improve SplitShardCmd performance and reliability. A new method of splitting has been introduced (splitMethod=link) which uses hard-linking of index files when possible, resulting in significant speedups and reduced CPU / IO load on shard leader.
      (ab)
    7. SOLR-11881: Retry update requests sent by leaders to it's followers
      (Varun Thacker, Mark Miller, Tomás Fernández Löbbe)
    8. SOLR-12616: Optimize Export writer upto 4 sort fields to get better performance. This was removed in SOLR-11598 but brought back in the same version
      (Amrit Sarkar, Varun Thacker)
    9. SOLR-12572: While exporting documents using the export writer, if a field is specified as a sort parameter and also in the fl (field list) parameter, we save on one doc-value lookup. This can bring performance improvements of 15% and upwards depending on how many fields are in common.
      (Amrit Sarkar, Varun Thacker)
    10. SOLR-10697: HttpShardHandler now uses a default of 100k as maxConnections (10k previously) and default maxConnectionsPerHost as 100k (20 previously). They are now consisent with the UpdateShardHandler defaults.
      (Varun Thacker)
    11. SOLR-12723: Reduce object creation in HashBasedRouter.
      (ab)
    12. SOLR-12766: When retrying internal requests, backoff only once for the full batch of retries
      (Tomás Fernández Löbbe)
  • Other Changes (25)
    1. SOLR-12208: Renamed the autoscaling variable 'INDEX.sizeInBytes' to 'INDEX.sizeInGB'
      (noble)
    2. SOLR-12523: Improve error reporting and docs regarding Collection backup feature shared-fs requirement
      (janhoy)
    3. SOLR-12468: Upgrade Jetty to 9.4.11.v20180605
      (Michael Braun, shalin)
    4. SOLR-12527: factor out a test-framework/ConfigRequest class
      (Christine Poerschke)
    5. SOLR-12412: Leader should give up leadership when IndexWriter.tragedy occur
      (Cao Manh Dat, Tomas Fernandez-Lobbe)
    6. SOLR-12551: Upgrade to Tika 1.18
      (Tim Allison via Erick Erickson)
    7. SOLR-12464: Reduce Overseer.close() logging (for non-Overseer leaders)
      (Christine Poerschke)
    8. SOLR-12454: Tweak Overseer leadership transition related logging for easier troubleshooting.
      (Christine Poerschke)
    9. SOLR-12574: Put under a common "significantTerms" bucket all output by SignificantTerms Query Parser
      (Alexandre Rafalovitch)
    10. SOLR-12164: Improve Ref Guide main landing page.
      (Cassandra Targett)
    11. SOLR-10984: Clean up web.xml, removing old redirects and outdated comments
      (Varun Thacker, janhoy)
    12. SOLR-12617: Remove Commons BeanUtils as a dependency
      (Varun Thacker)
    13. SOLR-11766: Move Streaming Expressions section in Ref Guide to be a top-level section.
      (Cassandra Targett)
    14. SOLR-12656: ShardSplitTest should extend AbstractFullDistribZkTestBase instead of BasicDistributedZkTest.
      (shalin)
    15. LUCENE-8456: Upgrade Apache Commons Compress to v1.18
      (Steve Rowe)
    16. SOLR-12014: Cryptic error message when creating a collection with sharding that violates autoscaling policies
      (noble)
    17. SOLR-12680: Fix ClassCastException and AIOOBE in TestSolrConfigHandlerConcurrent.
      (shalin)
    18. SOLR-12675: Make LeaderVoteWaitTimeoutTest more resilient against side effects of test methods.
      (shalin)
    19. SOLR-12130: CdcrReplicationDistributedZkTest is broken into two test classes, CdcrOpsAndBoundariesTest which does not require node restarts and CdcrWithNodesRestartsTest which does. The tests themselves are made faster and more resilient to spurious failures.
      (Varun Thacker, Amrit Sarkar via shalin)
    20. SOLR-12625: Combine SolrDocumentFetcher and RetrieveFieldsOptimizer
      (Erick Erickson)
    21. SOLR-12690: Regularize LoggerFactory declarations
      (Erick Erickson)
    22. SOLR-12590: Improve Solr resource loader coverage in the ref guide.
      (Steve Rowe, Cassandra Targett, Christine Poerschke)
    23. SOLR-12744: Improve logging messages and verbosity around recoveries
      (Cao Manh Dat, Varun Thacker)
    24. SOLR-8742: In HdfsDirectoryTest replace RAMDirectory usages with ByteBuffersDirectory.
      (hossman, Mark Miller, Andrzej Bialecki, Steve Rowe)
    25. SOLR-12771: Improve Autoscaling Policy and Preferences documentation.
      (hossman, Steve Rowe)
  • Versions of Major Components
    1. Apache Tika 1.17
    2. Carrot2 3.15.0
    3. Velocity 1.7 and Velocity Tools 2.0
    4. Apache UIMA 2.3.1
    5. Apache ZooKeeper 3.4.11
    6. Jetty 9.4.8.v20171121
  • Bug Fixes
    1. SOLR-12204: Upgrade commons-fileupload dependency to 1.3.3 to address CVE-2016-1000031.
      (Steve Rowe)
    2. SOLR-12256: Fixed some eventual-consistency issues with collection aliases by using ZooKeeper.sync().
      (David Smiley)
    3. SOLR-12087: Deleting replicas sometimes fails and causes the replicas to exist in the down state
      (Cao Manh Dat)
    4. SOLR-12146: LIR should skip deleted replicas
      (Cao Manh Dat)
    5. SOLR-12066: Cleanup deleted core when node start
      (Cao Manh Dat)
    6. SOLR-12065: A successful restore collection should mark the shard state as active and not buffering
      (Rohit, Varun Thacker)
    7. SOLR-11724: Cdcr bootstrapping should ensure that non-leader replicas should sync with the leader
      (Amrit Sarkar, Varun Thacker)
    8. SOLR-12202: Fix errors in solr-exporter.cmd.
      (Minoru Osuka via koji)
    9. SOLR-12316: Do not allow to use absolute URIs for including other files in solrconfig.xml and schema parsing.
      (Ananthesh, Ishan Chattopadhyaya, Uwe Schindler)
Top