TokyoDev scraper and Japanese tech recruitment data extraction API. Extract English-friendly job listings, salary ranges, remote policies, visa sponsorship, and Japanese language requirements from tokyodev.com with this Apify Actor. Fast Cheerio crawler with sitemap discovery. Free tier included.
Whether you are a global recruiter sourcing English-speaking tech talent in Japan, an HR analyst benchmarking JPY compensation, or a B2B sales rep hunting startup leads in Tokyo, this Actor delivers clean, structured datasets in JSON, CSV, or Excel.
Quick Start · Output Schema · Pricing · FAQ
TokyoDev scraper and Japanese tech recruitment data extraction API. Pull English-friendly tech job listings, JPY salary bounds, remote work policies, visa sponsorship details, and Japanese language requirements from tokyodev.com with this Apify Actor. Fast Cheerio crawler with sitemap discovery. Free tier included.
TokyoDev is a curated job board for software engineers and tech professionals seeking work in Japan. This Actor crawls every job and company page via sitemap.xml, returning structured fields for salary, seniority, location, and language requirements. No browser automation needed. The static HTML crawler runs faster and costs less than Playwright-based alternatives.
- Sitemap discovery parses sitemap.xml for all job and company URLs automatically
- Cheerio static crawler runs 5x faster than headless browser alternatives
- Complete salary data extracts min, max, and currency in JPY for compensation benchmarking
- Language and visa flags captures japanese_required, apply_from_abroad, and remote_policy fields
- Lower cost at $1.99 per 1,000 results compared to competing job board scrapers
- Multiple export formats download results as JSON, CSV, Excel, or XML directly from Apify
Install the Apify client and call the Actor with a single API request.
npm install apify-clientimport { ApifyClient } from 'apify-client';
const client = new ApifyClient({ token: process.env.APIFY_TOKEN });
const input = {
startUrls: ['https://www.tokyodev.com/'],
maxItems: 500,
};
const run = await client.actor('getascraper/tokyo-dev-scraper').call(input);
const { items } = await client.dataset(run.defaultDatasetId).listItems();
console.log(`Extracted ${items.length} job listings`);- Open the TokyoDev Scraper page on Apify Store
- Click Use Actor to open the input console
- Paste your target URL or leave the default
https://www.tokyodev.com/ - Set maxItems to limit results or leave blank for full crawl
- Click Start and wait for the run to complete
- Download your dataset as JSON, CSV, Excel, or XML from the Storage tab
| Field | Description |
|---|---|
| job_title | Position title as listed on the job page |
| company_name | Employer or recruiting company name |
| location | City, prefecture, or remote designation |
| job_type | Full-time, part-time, contract, or internship |
| seniority | Junior, mid-level, senior, lead, or principal |
| remote_policy | On-site, hybrid, fully remote, or flexible |
| japanese_required | Whether Japanese language skills are required |
| apply_from_abroad | Whether the employer accepts overseas applications |
| salary_min | Minimum annual salary in JPY |
| salary_max | Maximum annual salary in JPY |
| salary_currency | Currency code, typically JPY |
| description | Full job description text |
| requirements | Required skills and qualifications list |
| tags | Technology stack and category tags |
| posted_date | Date the listing was published |
| job_url | Direct link to the original job posting |
| scraped_at | ISO 8601 timestamp of when the data was extracted |
{
"job_title": "Senior Backend Engineer",
"company_name": "Mercari",
"location": "Tokyo, Japan",
"job_type": "Full-time",
"seniority": "Senior",
"remote_policy": "Hybrid",
"japanese_required": false,
"apply_from_abroad": true,
"salary_min": 8000000,
"salary_max": 12000000,
"salary_currency": "JPY",
"description": "We are looking for a Senior Backend Engineer to join our payments team...",
"requirements": ["5+ years backend experience", "Go or Ruby", "Docker", "Kubernetes"],
"tags": ["Go", "Ruby", "AWS", "Microservices"],
"posted_date": "2025-06-01",
"job_url": "https://www.tokyodev.com/jobs/mercari-senior-backend-engineer",
"scraped_at": "2025-06-05T10:30:00.000Z"
}$1.99 per 1,000 results. Free tier included.
| Items scraped | Cost (USD) |
|---|---|
| 100 | $0.20 |
| 500 | $1.00 |
| 1,000 | $1.99 |
| 5,000 | $9.95 |
| 10,000 | $19.90 |
Proxy configuration Use Apify residential or datacenter proxies to avoid IP-based rate limiting. The Actor auto-selects the optimal proxy group for tokyodev.com.
Concurrency Adjust max concurrency in the Actor input to balance speed against target server load. Default concurrency is set to a safe value for this domain.
Fast List Mode Enable fast list mode in the Actor settings to skip detailed page parsing and extract only job titles and URLs for rapid cataloging.
https://www.tokyodev.com/(homepage, triggers full sitemap crawl)https://www.tokyodev.com/jobs/*(individual job listing pages)https://www.tokyodev.com/companies/*(company profile pages)https://www.tokyodev.com/sitemap.xml(direct sitemap input)
- Global recruiters source English-speaking tech talent currently working in Japan or open to relocation
- HR analysts benchmark JPY salary ranges across companies, seniority levels, and tech stacks
- B2B sales reps identify startup leads and hiring companies in the Tokyo tech ecosystem
- Immigration consultants filter jobs by visa sponsorship and apply_from_abroad flags for overseas candidates
- ATS integrators feed structured job data into applicant tracking systems for automated matching
How does it bypass bot protections?
TokyoDev uses standard static HTML rendering with no anti-bot middleware. The Cheerio crawler fetches pages directly without triggering CAPTCHAs or JavaScript challenges. No proxy rotation is required for normal usage.
Do I need a login?
No. All job listings on tokyodev.com are publicly accessible. The Actor does not require authentication, API keys, or a TokyoDev account to extract data.
What formats are supported?
Results can be downloaded from Apify Storage as JSON, CSV, Excel, or XML. The Actor also supports direct API access via the Apify REST API for programmatic consumption.
Can I filter by remote work?
Yes. The output includes a remote_policy field with values like on-site, hybrid, fully remote, or flexible. Filter your downloaded dataset by this field to isolate remote-friendly positions.
This is an independent tool and is not affiliated with, endorsed by, or connected to TokyoDev or tokyodev.com. All extracted data is publicly available on the target website. Users are responsible for complying with applicable data protection laws and the target site terms of service.
Report issues or request features on GitHub Issues.
For Actor-specific problems, open a ticket on the Apify Console.
