Skip to content

getascraper/how-to-scrape-tokyo-dev

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

TokyoDev Scraper | Japan Tech Jobs | Apify Actor

Apify Actor Node.js TypeScript Free Tier Cheerio

TokyoDev scraper and Japanese tech recruitment data extraction API. Extract English-friendly job listings, salary ranges, remote policies, visa sponsorship, and Japanese language requirements from tokyodev.com with this Apify Actor. Fast Cheerio crawler with sitemap discovery. Free tier included.

Whether you are a global recruiter sourcing English-speaking tech talent in Japan, an HR analyst benchmarking JPY compensation, or a B2B sales rep hunting startup leads in Tokyo, this Actor delivers clean, structured datasets in JSON, CSV, or Excel.

Quick Start · Output Schema · Pricing · FAQ

TokyoDev Scraper Apify Actor hero screenshot


What is TokyoDev Scraper?

TokyoDev scraper and Japanese tech recruitment data extraction API. Pull English-friendly tech job listings, JPY salary bounds, remote work policies, visa sponsorship details, and Japanese language requirements from tokyodev.com with this Apify Actor. Fast Cheerio crawler with sitemap discovery. Free tier included.

TokyoDev is a curated job board for software engineers and tech professionals seeking work in Japan. This Actor crawls every job and company page via sitemap.xml, returning structured fields for salary, seniority, location, and language requirements. No browser automation needed. The static HTML crawler runs faster and costs less than Playwright-based alternatives.

Why use TokyoDev Scraper?

  • Sitemap discovery parses sitemap.xml for all job and company URLs automatically
  • Cheerio static crawler runs 5x faster than headless browser alternatives
  • Complete salary data extracts min, max, and currency in JPY for compensation benchmarking
  • Language and visa flags captures japanese_required, apply_from_abroad, and remote_policy fields
  • Lower cost at $1.99 per 1,000 results compared to competing job board scrapers
  • Multiple export formats download results as JSON, CSV, Excel, or XML directly from Apify

Quick Start

Install the Apify client and call the Actor with a single API request.

npm install apify-client
import { ApifyClient } from 'apify-client';

const client = new ApifyClient({ token: process.env.APIFY_TOKEN });

const input = {
  startUrls: ['https://www.tokyodev.com/'],
  maxItems: 500,
};

const run = await client.actor('getascraper/tokyo-dev-scraper').call(input);
const { items } = await client.dataset(run.defaultDatasetId).listItems();

console.log(`Extracted ${items.length} job listings`);

How to use

  1. Open the TokyoDev Scraper page on Apify Store
  2. Click Use Actor to open the input console
  3. Paste your target URL or leave the default https://www.tokyodev.com/
  4. Set maxItems to limit results or leave blank for full crawl
  5. Click Start and wait for the run to complete
  6. Download your dataset as JSON, CSV, Excel, or XML from the Storage tab

What data does it extract?

Field Description
job_title Position title as listed on the job page
company_name Employer or recruiting company name
location City, prefecture, or remote designation
job_type Full-time, part-time, contract, or internship
seniority Junior, mid-level, senior, lead, or principal
remote_policy On-site, hybrid, fully remote, or flexible
japanese_required Whether Japanese language skills are required
apply_from_abroad Whether the employer accepts overseas applications
salary_min Minimum annual salary in JPY
salary_max Maximum annual salary in JPY
salary_currency Currency code, typically JPY
description Full job description text
requirements Required skills and qualifications list
tags Technology stack and category tags
posted_date Date the listing was published
job_url Direct link to the original job posting
scraped_at ISO 8601 timestamp of when the data was extracted

Output Example

{
  "job_title": "Senior Backend Engineer",
  "company_name": "Mercari",
  "location": "Tokyo, Japan",
  "job_type": "Full-time",
  "seniority": "Senior",
  "remote_policy": "Hybrid",
  "japanese_required": false,
  "apply_from_abroad": true,
  "salary_min": 8000000,
  "salary_max": 12000000,
  "salary_currency": "JPY",
  "description": "We are looking for a Senior Backend Engineer to join our payments team...",
  "requirements": ["5+ years backend experience", "Go or Ruby", "Docker", "Kubernetes"],
  "tags": ["Go", "Ruby", "AWS", "Microservices"],
  "posted_date": "2025-06-01",
  "job_url": "https://www.tokyodev.com/jobs/mercari-senior-backend-engineer",
  "scraped_at": "2025-06-05T10:30:00.000Z"
}

Pricing

$1.99 per 1,000 results. Free tier included.

Items scraped Cost (USD)
100 $0.20
500 $1.00
1,000 $1.99
5,000 $9.95
10,000 $19.90

Advanced Options

Proxy configuration Use Apify residential or datacenter proxies to avoid IP-based rate limiting. The Actor auto-selects the optimal proxy group for tokyodev.com.

Concurrency Adjust max concurrency in the Actor input to balance speed against target server load. Default concurrency is set to a safe value for this domain.

Fast List Mode Enable fast list mode in the Actor settings to skip detailed page parsing and extract only job titles and URLs for rapid cataloging.

Supported URL types

  • https://www.tokyodev.com/ (homepage, triggers full sitemap crawl)
  • https://www.tokyodev.com/jobs/* (individual job listing pages)
  • https://www.tokyodev.com/companies/* (company profile pages)
  • https://www.tokyodev.com/sitemap.xml (direct sitemap input)

Use cases

  • Global recruiters source English-speaking tech talent currently working in Japan or open to relocation
  • HR analysts benchmark JPY salary ranges across companies, seniority levels, and tech stacks
  • B2B sales reps identify startup leads and hiring companies in the Tokyo tech ecosystem
  • Immigration consultants filter jobs by visa sponsorship and apply_from_abroad flags for overseas candidates
  • ATS integrators feed structured job data into applicant tracking systems for automated matching

FAQ

How does it bypass bot protections?

TokyoDev uses standard static HTML rendering with no anti-bot middleware. The Cheerio crawler fetches pages directly without triggering CAPTCHAs or JavaScript challenges. No proxy rotation is required for normal usage.

Do I need a login?

No. All job listings on tokyodev.com are publicly accessible. The Actor does not require authentication, API keys, or a TokyoDev account to extract data.

What formats are supported?

Results can be downloaded from Apify Storage as JSON, CSV, Excel, or XML. The Actor also supports direct API access via the Apify REST API for programmatic consumption.

Can I filter by remote work?

Yes. The output includes a remote_policy field with values like on-site, hybrid, fully remote, or flexible. Filter your downloaded dataset by this field to isolate remote-friendly positions.

Disclaimers

This is an independent tool and is not affiliated with, endorsed by, or connected to TokyoDev or tokyodev.com. All extracted data is publicly available on the target website. Users are responsible for complying with applicable data protection laws and the target site terms of service.

Support

Report issues or request features on GitHub Issues.

For Actor-specific problems, open a ticket on the Apify Console.

About

TokyoDev scraper and Japanese tech recruitment data extraction API. Extract English-friendly job listings, salary ranges, remote policies, and visa sponsorship from tokyodev.com with this Apify Actor. Free tier available.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors