Automatically detect and redact driver's license numbers from all 50 US states and international formats with intelligent state identification and format validation.
All 50 states plus international formats
Detect driver's license formats from all 50 US states, DC, and US territories with state-specific rules.
Recognize license formats from Canada, UK, EU countries, Australia, and other major nations.
Validate license numbers against state-specific patterns and check digit algorithms.
Automatically identify the issuing state based on format and contextual clues.
Extract and redact license numbers from scanned IDs, photos, and document images.
Support DPPA, state privacy laws, and industry-specific ID protection requirements.
Simple integration, powerful results
Send your documents, text, or files through our secure API endpoint or web interface.
Our AI analyzes content to identify all sensitive information types with 99.7% accuracy.
Sensitive data is automatically redacted based on your configured compliance rules.
Receive your redacted content with full audit trail and compliance documentation.
Get started with just a few lines of code
import requests
api_key = "your_api_key"
url = "https://api.redactionapi.net/v1/redact"
data = {
"text": "John Smith's SSN is 123-45-6789",
"redaction_types": ["ssn", "person_name"],
"output_format": "redacted"
}
response = requests.post(url,
headers={"Authorization": f"Bearer {api_key}"},
json=data
)
print(response.json())
# Output: {"redacted_text": "[PERSON_NAME]'s SSN is [SSN_REDACTED]"}
const axios = require('axios');
const apiKey = 'your_api_key';
const url = 'https://api.redactionapi.net/v1/redact';
const data = {
text: "John Smith's SSN is 123-45-6789",
redaction_types: ["ssn", "person_name"],
output_format: "redacted"
};
axios.post(url, data, {
headers: { 'Authorization': `Bearer ${apiKey}` }
})
.then(response => {
console.log(response.data);
// Output: {"redacted_text": "[PERSON_NAME]'s SSN is [SSN_REDACTED]"}
});
curl -X POST https://api.redactionapi.net/v1/redact \
-H "Authorization: Bearer your_api_key" \
-H "Content-Type: application/json" \
-d '{
"text": "John Smith's SSN is 123-45-6789",
"redaction_types": ["ssn", "person_name"],
"output_format": "redacted"
}'
# Response:
# {"redacted_text": "[PERSON_NAME]'s SSN is [SSN_REDACTED]"}
Driver's licenses serve as the primary form of identification for most Americans, appearing in virtually every identity verification scenario—from opening bank accounts to renting apartments, from employment applications to insurance claims. This ubiquity creates extraordinary exposure risk. Unlike Social Security Numbers, which individuals are increasingly cautious about sharing, driver's license numbers flow freely through commerce, creating countless opportunities for interception and misuse.
The complexity of driver's license detection stems from America's federalist approach to licensing. Each of the 50 states, plus the District of Columbia and US territories, maintains its own license numbering system. Some states use purely numeric formats, others mix letters and numbers. Some have check digits, others don't. Format lengths range from 7 to 14 characters. Effective detection requires comprehensive knowledge of every state's format, combined with intelligent analysis to distinguish license numbers from similar alphanumeric patterns.
Understanding the diversity of state formats illustrates the detection challenge:
California: One letter followed by exactly 7 digits (e.g., A1234567). The first letter typically corresponds to the first letter of the last name but follows specific assignment rules.
New York: Exactly 9 digits with no letters. Format appears purely numeric but follows an internal structure based on soundex encoding of the licensee's name.
Florida: One letter followed by 12 digits, creating one of the longest license numbers. The format includes encoded personal information following a specific algorithm.
Texas: 8 digits for most licenses, though older formats used 7 digits. No letters are included in the standard format.
Illinois: One letter followed by 11 digits. The format encodes demographic information using a specific algorithm.
Pennsylvania: 8 digits with no letters. The format includes a check digit calculated from the other digits.
Ohio: 2 letters followed by 6 digits, creating a distinctive format easily distinguished from other states.
Our detection engine uses a multi-layered approach to identify driver's license numbers accurately:
Format Matching: We maintain detailed specifications for each state's license format, including character types, lengths, position rules, and known prefixes. Incoming text is scanned for patterns matching these specifications.
State Identification: When a pattern matches multiple states' formats, we analyze context to determine the most likely issuing state. State names, abbreviations, addresses, and other geographic indicators help resolve ambiguity.
Check Digit Validation: States that use check digits (calculated from other digits in the number) enable validation. We verify check digits to confirm valid license numbers and reduce false positives.
Soundex Verification: Some states encode names using soundex or similar algorithms. When name information is available in context, we can verify that the license number's encoded portion matches, confirming accuracy.
Context Analysis: Terms like "driver's license," "DL," "license number," "DL#," or state DMV references near a matching pattern strongly indicate license data. We weight these contextual signals in classification decisions.
Physical driver's licenses are frequently photographed or scanned for various purposes—identity verification, record-keeping, compliance documentation. These images contain not just the license number but extensive personal information:
Our image processing extracts all text using OCR, then applies the same detection logic as text documents. Visual redaction can selectively obscure specific regions—blurring the license number while preserving other elements, or redacting the entire document face.
The 2D barcode on modern licenses presents a special challenge and opportunity. While containing redundant data, the barcode can be decoded to verify OCR accuracy and ensure complete data capture. Our processing includes barcode detection and decoding for comprehensive protection.
The Real ID Act of 2005 introduced federal standards for state-issued identification. Real ID compliant licenses include security features and require additional documentation for issuance, but the license number format remains determined by each state. Our detection handles Real ID compliant licenses identically to standard licenses—the compliance affects physical security features, not the number format.
Some states offer Enhanced Driver's Licenses (EDLs) that serve as passport alternatives for certain border crossings. These licenses may include RFID chips and additional data, but the license number format typically matches standard licenses. Our detection covers EDLs through standard format recognition.
Beyond US licenses, our detection covers driver's licenses from major countries worldwide:
Canada: Each province maintains its own format. Ontario uses a letter-number pattern with check digit. Quebec uses a letter followed by 12 digits. British Columbia uses 7 digits. Alberta uses up to 9 digits. Our detection includes all 13 provinces and territories.
United Kingdom: UK licenses use a complex format encoding personal information—name, date of birth, gender—into a 16-character identifier. We decode and validate this format.
European Union: EU directive established common standards, but member states retain distinct numbering systems. German licenses use alphanumeric sequences. French licenses use a 12-digit format with regional coding.
Australia: Each state uses different formats. New South Wales uses a 7-digit format. Victoria uses 8-10 digits. Queensland uses 8-9 digits. We support all Australian states and territories.
International Driving Permits: IDPs follow standardized formatting under the 1968 Vienna Convention on Road Traffic. We detect IDP numbers alongside national licenses.
Driver's license data appears across numerous industries with specific protection requirements:
Financial Services: Banks use license numbers for identity verification when opening accounts or processing applications. This data flows through core banking systems, document management, and compliance databases.
Insurance: Auto insurers collect license information for policy underwriting, claims processing, and fraud detection. License data appears in applications, claims documents, and investigation files.
Healthcare: Medical facilities collect license numbers for patient identification and insurance verification. This data appears in registration systems, medical records, and billing documents.
Retail and Commerce: Age-restricted purchases, check acceptance, and return policies often involve license scanning. This data accumulates in POS systems and transaction logs.
Employment: I-9 verification and background checks involve license documentation. HR systems store this information in employee records and compliance files.
Property Management: Rental applications require identity verification including license information. This data appears in tenant files, lease documents, and screening reports.
Driver's license data protection intersects with multiple regulatory requirements:
Driver's Privacy Protection Act (DPPA): Federal law restricting disclosure of personal information from motor vehicle records. Applies to state DMVs and those who receive data from DMV records.
State Data Protection Laws: Many states include driver's license numbers in their definitions of personal information subject to breach notification and protection requirements.
GLBA (Financial): Financial institutions must protect license numbers as customer personal information under Gramm-Leach-Bliley Act safeguards.
HIPAA (Healthcare): While not a HIPAA identifier per se, license numbers appearing in medical records become protected health information requiring appropriate handling.
Industry Standards: PCI DSS, SOC 2, and other compliance frameworks expect protection of government-issued identifiers including driver's licenses.
Different use cases require different redaction approaches:
Full Redaction: Replace the entire license number with [DL_REDACTED] or similar marker. Appropriate when no reference to the original is needed.
Partial Masking: Show only the last few characters (e.g., ****4567) for verification purposes while protecting the full number.
State Preservation: Redact the number while indicating the state (e.g., [CA_DL_REDACTED]) for demographic purposes.
Tokenization: Replace with a consistent token enabling data linking without exposing the actual number. Essential for analytics while maintaining referential integrity.
License number formats can overlap with other identifiers, requiring careful false positive management:
Numeric Overlap: A purely numeric license (like New York's 9-digit format) might match phone numbers, account numbers, or other numeric sequences. Context analysis distinguishes license numbers from similar patterns.
Alphanumeric Ambiguity: Formats mixing letters and numbers (like California's) can match product codes, serial numbers, or reference identifiers. Format-specific rules and context help differentiate.
Cross-State Overlap: Some state formats overlap with others. Texas's 8-digit format could match Pennsylvania's format. State identification through context resolves these cases.
Confidence Scoring: Our detection provides confidence scores indicating match quality. High-confidence matches with strong context are treated differently from potential matches requiring human review.
RedactionAPI has transformed our document processing workflow. We've reduced manual redaction time by 95% while achieving better accuracy than our previous manual process.
The API integration was seamless. Within a week, we had automated redaction running across all our customer support channels, ensuring GDPR compliance effortlessly.
We process over 50,000 legal documents monthly. RedactionAPI handles it all with incredible accuracy and speed. It's become an essential part of our legal tech stack.
The multi-language support is outstanding. We operate in 30 countries and RedactionAPI handles all our documents regardless of language with consistent accuracy.
Trusted by 500+ enterprises worldwide





Each US state has its own driver's license number format—California uses a letter followed by 7 digits, New York uses 9 digits, Florida uses a letter followed by 12 digits. Our detection engine includes format specifications for all 50 states plus DC and territories, with validation rules specific to each jurisdiction.
In many cases, yes. When the format is unique to a specific state (like Florida's 13-character format), we can identify the issuing state. When formats overlap between states, we use context clues—state names, abbreviations, addresses—to determine the most likely issuer.
Yes, Real ID compliance doesn't change the license number format—it affects the physical card features. Our detection works with both Real ID compliant and non-compliant licenses, focusing on the number format rather than card attributes.
For images of physical licenses, we use OCR to extract text including the license number, name, address, and other fields. Visual redaction can then blur or black-box sensitive regions. We support front and back of license images.
We detect International Driving Permits (IDPs) which follow a standardized format, as well as native licenses from major countries including Canada (provincial formats), UK, Germany, France, Australia (state formats), Japan, and others.
License number formats can overlap with other identifiers. We use format validation, check digit verification where applicable, context analysis (looking for license-related terms), and state identification to distinguish actual license numbers from similar patterns.