@@ -359,26 +359,22 @@ def bulk_delete(
359359 return result
360360
361361
362- def bulk_index (
363- client : OpenSearch , index : str , records : Iterator [dict ], action : str
364- ) -> dict [str , int ]:
362+ def bulk_index (client : OpenSearch , index : str , records : Iterator [dict ]) -> dict [str , int ]:
365363 """Indexes records into an existing index using the streaming bulk helper.
366364
367- This method uses the OpenSearch "index" and "update" operations.
368- - Setting `action` to "index" will either create or update a record.
369- If a record with the same _id exists in the index, it will be updated;
370- if it does not exist, the record will be added as a new document.
371- - Setting `action` to "update" will update a document only if it exists
372- in the index. Otherwise, an error is raised.
365+ This action function uses the OpenSearch "index" action, which is a
366+ combination of create and update: if a record with the same _id exists in the
367+ index, it will be updated. If it does not exist, the record will be indexed as a
368+ new document.
373369
374- If an error occurs during the operation , it will be logged, and the bulk
375- operation will continue until all records have been processed.
370+ If an error occurs during record indexing , it will be logged and bulk indexing will
371+ continue until all records have been processed.
376372
377373 Returns total sums of: records created, records updated, errors, and total records
378374 processed.
379375 """
380376 result = {"created" : 0 , "updated" : 0 , "errors" : 0 , "total" : 0 }
381- actions = helpers .generate_bulk_actions (index , records , action )
377+ actions = helpers .generate_bulk_actions (index , records , "index" )
382378 responses = streaming_bulk (
383379 client ,
384380 actions ,
@@ -404,14 +400,13 @@ def bulk_index(
404400 result ["updated" ] += 1
405401 else :
406402 logger .error (
407- "Something unexpected happened during ingest. "
408- f"Bulk { action } response: %s" ,
403+ "Something unexpected happened during ingest. Bulk index response: %s" ,
409404 json .dumps (response ),
410405 )
411406 result ["errors" ] += 1
412407 result ["total" ] += 1
413408 if result ["total" ] % int (os .getenv ("STATUS_UPDATE_INTERVAL" , "1000" )) == 0 :
414- logger .info ("Status update: %s records processed so far!" , result ["total" ])
409+ logger .info ("Status update: %s records indexed so far!" , result ["total" ])
415410 logger .info ("All records ingested, refreshing index." )
416411 response = client .indices .refresh (
417412 index = index ,
0 commit comments