Building bilingual NER for cargo logistics with Amazon Bedrock
IBS Software’s Cargo system processes thousands of bilingual cargo logistics email messages daily. The system extracts critical information such as air waybill (AWB) numbers, flight details, weights, and delivery instructions in both English and Japanese. This added to the complexity of building a robust Named Entity Recognition (NER) solution. Challenges included manual intervention that slowed operations and a trade-off between accuracy and cost. IBS Software needed an AI solution that could accurately identify 23 different entity types across two languages while remaining cost-effective at scale.
After exploring multiple approaches, IBS Software used managed distillation capabilities of Amazon Bedrock to create a production-ready solution. By distilling knowledge from Amazon Nova Pro into the more efficient Amazon Nova Lite model, IBS Software achieved 95.085 percent F1-Score accuracy while reducing operational costs by 14x. This case study details their journey from facing complex open-source implementations to a successful deployment on AWS that now processes cargo email messages in real time.
In this post, we share the technical approach using token-based distillation, lessons learned, and deployment architecture. If you face similar bilingual NER challenges, you can benefit from IBS Software’s experience with the Amazon Bedrock knowledge distillation capabilities.
The goal was to build a bilingual NER system capable of extracting 23 entity types from cargo logistics email messages written in English and Japanese. The key entities include:
The primary risks included maintaining high accuracy across both languages, managing inference costs at scale, and achieving low latency for real-time processing. With the model distillation capabilities of Amazon Bedrock, you can use smaller, faster, and more cost-effective models. These models deliver accuracy for your use case that is comparable to the most advanced models in Amazon Bedrock.
The following diagram shows the end-to-end bilingual NER workflow on Amazon Bedrock.
Figure 1: End-to-end bilingual NER workflow on Amazon Bedrock
IBS’s team of nine researchers and engineers spent approximately 4 months developing and deploying this solution. The project timeline included:
IBS Software deployed all infrastructure using Amazon Bedrock managed services, which bypassed the need for custom model hosting infrastructure.
Initially, the team attempted knowledge distillation using open-source frameworks including PyTorch-based implementations and the TextBrewer library. These approaches failed because of:
For more details on knowledge distillation fundamentals, see AWS Machine Learning Best Practices.
We pivoted to Amazon Bedrock Model Distillation, using Amazon Nova Pro as the teacher model and Nova Lite as the student model. The key advantages included:
The training process reduced loss from 0.05 to 0.008 over 70 steps, indicating strong knowledge transfer from teacher to student.
For Amazon Bedrock distillation documentation, see Customize a model with distillation in Amazon Bedrock.
Our dataset consisted of 500 real-world cargo logistics email messages:
Each email message was manually annotated for 23 entity types by domain experts familiar with cargo logistics terminology. The annotation process took approximately 3 weeks and supplied high-quality training data for both languages.
We evaluated both teacher and student models using F1-Score, the harmonic mean of precision and recall:
Although the base Nova Lite model offered approximately 84 percent overall F1-Score, the teacher model and the customized Nova Lite model achieved an approximately 10 percent uplift in accuracy. The following table shows the F1-Score results.
The distilled Nova Lite model retained 98 percent of the teacher’s performance while providing 14x cost reduction in production inference.
We observed that the student model showed a 2.565 percent lower F1-Score on Japanese text than on English text. This gap came primarily from complex kanji character combinations in commodity descriptions, ambiguous entity boundaries in Japanese text without spaces, and the smaller volume of Japanese training data (150 compared to 350 email messages). Multi-line delivery instructions with embedded entities also occasionally caused boundary detection errors.
To overcome these challenges, we augmented Japanese training data with synthetic examples. We also applied post-processing rules for known entity patterns (AWB format, flight number regex) and implemented confidence thresholding to flag low-confidence predictions for human review.
Note: The following deployment creates AWS resources that incur charges. Amazon Simple Storage Service (Amazon S3) storage, AWS Lambda invocations, Amazon Bedrock model inference, and Amazon DynamoDB storage all have associated costs. Delete these resources when you no longer need them to avoid ongoing charges.
Our production deployment processes .eml files through the following workflow:
For Lambda integration patterns, see AWS Lambda with Amazon Bedrock.
The entire pipeline processes email messages in under 2 seconds with 95.085 percent accuracy, meeting our real-time processing requirements.
In this post, we showed how IBS Software used Amazon Bedrock managed distillation capabilities to build a cost-effective bilingual NER system for cargo logistics. The system achieves 95.085 percent F1-Score while reducing operational costs by 14x. The distilled Nova Lite model retains 98 percent of the teacher model’s performance, making it ideal for high-volume production workloads.
Our key takeaway was that Amazon Bedrock managed distillation capabilities alleviated the complexity of open-source frameworks. The token-level knowledge distillation preserved accuracy across both English and Japanese, and the 2048-token sequence length accommodated typical cargo email lengths. Production deployment with AWS Lambda and Amazon S3 integration required minimal custom infrastructure.
If you’re facing similar bilingual NER challenges, consider:
For more information about the topics discussed in this post, see the following resources:
If you’re working on bilingual NER or knowledge distillation for your own use case, we’d love to hear about your experience. Share your questions or feedback in the comments.
Related Stories
AI News
England 2026 World Cup news: Reece James and Jarell Quansah to miss last
6 minutes ago
AI News
9 children killed as tricycle plunges into a canal in Egypt
7 minutes ago
AI News
World Cup 2026: Germany fans in need of hope after exit as Jurgen Klopp looms
7 minutes ago
AI News
U.S. Supreme Court upholds Idaho, West Virginia laws banning transgender athletes from women's sports
7 minutes ago
AI News
United States announces roster for 2026 World Junior Summer Showcase
7 minutes ago
AI News
Mayawati terms Ram Temple 'scam' serious, warns against politics
7 minutes ago
AI News
KC Venugopal slams 'vendetta politics' over journalist's passport denial
7 minutes ago
AI News
Ram Mandir 'Chanda Chori': Probe, Politics & Posturing; Congress At Ram's Feet For Votes? WATCH
7 minutes ago