Skip to content

Glue Crawler

Schedule-driven schema discovery into Glue Data Catalog.

analytics
category
15
settings
2
inputs
1
outputs
SettingTypeRequiredDefault
Crawler nameTextYes
DescriptionText
Target databaseTextYes
Cron scheduleText
IAM role ARNText
Table prefixText
S3 pathsList
JDBC connectionsList
DynamoDB tablesList
Recrawl policy
Options: Crawl everything, Crawl new folders only, Event-driven
ChoiceCRAWL_EVERYTHING
Schema change update behavior
Options: Update in database, Log only
ChoiceUPDATE_IN_DATABASE
Schema change delete behavior
Options: Log only, Delete from database, Deprecate in database
ChoiceDEPRECATE_IN_DATABASE
Custom classifiersList
Configuration (JSON)Text
TagsKey–value
SocketDirectionAcceptsTerraform arg
SourceInputany
IAM roleInputaws.iam-rolerole
Data CatalogOutputany