surnam.es is a leading global platform dedicated to the analysis and dissemination of surname distribution data. We are committed to providing accurate, up-to-date and accessible information in 16 languages, backed by official sources and a rigorous verification process that ensures the highest reliability of the data presented.
Data Sources
The reliability of our data is built upon the diversity and quality of our sources. We work exclusively with publicly available data from internationally recognized official institutions:
- National statistics offices: ONS (United Kingdom), INSEE (France), ISTAT (Italy), Destatis (Germany), INE (Spain), Statistics Canada, US Census Bureau, ABS (Australia), Statistics New Zealand, CBS (Netherlands), SCB (Sweden), among many others.
- Civil registries and national censuses: Official population data from national censuses, civil registries, municipal rolls and electoral registers, wherever publicly available and permitted for use.
- Regional and local administrations: Public databases from regional, provincial, cantonal and municipal governments that complement and enrich national-level data, providing greater geographic granularity.
- International organizations: Open data from the United Nations, Eurostat, the World Bank, the OECD and other supranational institutions that provide comparative frameworks and contextual demographic data.
Collection and Processing
Every data point published on our platform goes through a structured four-phase process designed to ensure maximum quality and consistency:
- Collection: Systematic extraction of data from official public sources. The most recent and authoritative source available is always prioritized for each country. Data is obtained through direct access to official statistics portals, open file downloads and public data API queries.
- Normalization: Standardization of formats, full Unicode encoding, transliteration of non-Latin scripts (Cyrillic, Arabic, Thai, Hindi, Japanese) and unification of spelling variants to enable coherent cross-country comparisons. Capitalization rules, duplicate removal and numeric format harmonization are applied.
- Cross-verification: Cross-referencing data across multiple independent sources to validate consistency. Outliers or data showing significant discrepancies between sources are reviewed manually. Statistical controls are applied to detect anomalies, transcription errors or incomplete data.
- Publication and continuous review: Validated data is published and subjected to periodic review. Each page includes its last update date for complete transparency. Data is accessible in 16 languages simultaneously through our localization system.
Update Frequency
Data updates are carried out continuously, prioritized by demand and the availability of new data. Countries with the highest query volumes are reviewed most frequently. Each country page displays the exact date of its last revision, allowing users to assess information currency.
When a statistics office publishes new census data or registry updates, our team incorporates them as quickly as possible, typically within days to weeks of official publication.
Geographic and Language Coverage
We currently cover surname data from over 100 countries across five continents, with information accessible in 16 languages: Spanish, English, French, German, Italian, Portuguese, Polish, Dutch, Catalan, Romanian, Swedish, Danish, Hungarian, Czech, Finnish and Turkish.
Our coverage spans from countries with the most comprehensive and detailed statistical records (Western Europe, North America, Oceania) to regions with more limited open data availability (Africa, parts of Asia), where we work with the best sources available and maintain a continuous effort to expand coverage.
Personal Data Handling
It is important to note that surnam.es does not store or publish individual personal data. All information presented consists of aggregate, statistical data: surname frequencies by country, comparative rankings and geographic distributions. Individual persons associated with a surname are never identified.
Quality Commitment
We are committed to full transparency about our sources and methods. Every methodological decision is aimed at maximizing the accuracy and usefulness of data for our users. If you spot any errors, have access to sources that could improve our data or have suggestions about our methodology, we invite you to contact us. Continuous improvement is a fundamental pillar of our project.
Known Limitations
The data presented are approximations based on the best official sources available. Actual figures may vary due to:
- Differences in census periods and update frequencies between countries
- Variable registration criteria and coverage depending on each nation's legislation
- Recent demographic changes not yet reflected in the latest available sources
- Variations in the granularity and detail of data published by each statistics office
- Differences in the transliteration of surnames from non-Latin scripts
When a country provides partial data or has significant limitations, this is clearly indicated on the corresponding page.