注意:以下翻译的准确性尚未经过验证。这是使用 AIP ↗ 从原始英文文本进行的机器翻译。

外部合并

支持于:批处理

外部合并提供的数据集输入,保留两个数据集中的所有行。当没有满足提供的条件的行时,列将为空。

变换类别: 合并

声明的参数

  • 选择左侧列的条件 - 将测试左侧输入模式中的所有列,以查看它们是否满足此条件。如果满足,列将被选入输出。
    ColumnPredicate
  • 选择右侧列的条件 - 将测试右侧输入模式中的所有列,以查看它们是否满足此条件。如果满足,列将被选入输出。
    ColumnPredicate
  • 合并条件 - 用于合并的条件。
    Expression<Boolean>
  • 左侧数据集 - 用于合并的左侧数据集。
    Table
  • 右侧数据集 - 用于合并的右侧数据集。
    Table
  • 非必填 右侧列的前缀 - 添加到右侧所有列的前缀。
    Literal<字符串>

示例

示例1:基本情况

参数值:

  • 选择左侧列的条件:
    columnNameIsIn(
     columnNames: [tail_number, airline],
    )
  • 选择右侧列的条件:
    columnNameIsIn(
     columnNames: [home_airport],
    )
  • 合并条件:
    equals(
     left: tail_number,
     right: tail_number,
    )
  • 左侧数据集: ri.foundry.main.dataset.left
  • 右侧数据集: ri.foundry.main.dataset.right
  • 右侧列的前缀: null

输入: ri.foundry.main.dataset.left

tail_numberairlinemilesfactor
XB-123foundry air1242
MT-222new airline11235
XB-123foundry airline3355
MT-222new air5654
KK-452new air2221
PA-452new air2122
XB-123foundry airline11342

ri.foundry.main.dataset.right

tail_numberhome_airport
XB-123LHR
MT-222CPH
KK-452JFK
JR-201IAD

输出:

tail_numberairlinehome_airport
XB-123foundry airLHR
MT-222new airlineCPH
XB-123foundry airlineLHR
MT-222new airCPH
KK-452new airJFK
PA-452new airnull
XB-123foundry airlineLHR
JR-201nullIAD

示例2:基本情况

描述: 简单复杂的合并条件。 参数值:

  • 选择左侧列的条件:
    columnNameIsIn(
     columnNames: [tail_number, airline, factor],
    )
  • 选择右侧列的条件:
    columnNameIsIn(
     columnNames: [tail_number, home_airport, factor],
    )
  • 合并条件:
    lessThan(
     left: factor,
     right: factor,
    )
  • 左侧数据集: ri.foundry.main.dataset.left
  • 右侧数据集: ri.foundry.main.dataset.right
  • 右侧列的前缀: right_

输入: ri.foundry.main.dataset.left

tail_numberairlinemilesfactor
XB-123foundry air1242
MT-222new airline11235
XB-123foundry airline3355
MT-222new air5654
KK-452new air2221
PA-452new air2122
XB-123foundry airline11342

ri.foundry.main.dataset.right

tail_numberhome_airportfactor
XB-123LHR2
MT-222CPH1
KK-452JFK10
JR-201IAD4

输出:

tail_numberairlinefactorright_tail_numberright_home_airportright_factor
XB-123foundry air2KK-452JFK10
XB-123foundry air2JR-201IAD4
MT-222new airline5KK-452JFK10
XB-123foundry airline5KK-452JFK10
MT-222new air4KK-452JFK10
KK-452new air1XB-123LHR2
KK-452new air1KK-452JFK10
KK-452new air1JR-201IAD4
PA-452new air2KK-452JFK10
PA-452new air2JR-201IAD4
XB-123foundry airline2KK-452JFK10
XB-123foundry airline2JR-201IAD4
nullnullnullMT-222CPH1

示例3:基本情况

参数值:

  • 选择左侧列的条件:
    columnNameIsIn(
     columnNames: [tail_number, airline, factor],
    )
  • 选择右侧列的条件:
    columnNameIsIn(
     columnNames: [home_airport],
    )
  • 合并条件:
    and(
     conditions: [
    equals(
     left: tail_number,
     right: tail_number,
    ),
    equals(
     left: factor,
     right: factor,
    )],
    )
  • 左侧数据集: ri.foundry.main.dataset.left
  • 右侧数据集: ri.foundry.main.dataset.right
  • 右侧列的前缀: null

输入: ri.foundry.main.dataset.left

tail_numberairlinemilesfactor
XB-123foundry air1242
MT-222new airline11235
XB-123foundry airline3355
MT-222new air5654
KK-452new air2221
PA-452new air2122
XB-123foundry airline11342

ri.foundry.main.dataset.right

tail_numberhome_airportfactor
XB-123LHR2
MT-222CPH1
KK-452JFK10
JR-201IAD4

输出:

tail_numberairlinefactorhome_airport
XB-123foundry air2LHR
MT-222new airline5null
XB-123foundry airline5null
MT-222new air4null
KK-452new air1null
PA-452new air2null
XB-123foundry airline2LHR
JR-201null4IAD
KK-452null10JFK
MT-222null1CPH

示例4:基本情况

描述: 注意,左侧和右侧的tail_number值已合并。 参数值:

  • 选择左侧列的条件:
    allColumns(

    )
  • 选择右侧列的条件:
    columnNameIsIn(
     columnNames: [home_airport],
    )
  • 合并条件:
    equals(
     left: tail_number,
     right: tail_number,
    )
  • 左侧数据集: ri.foundry.main.dataset.left
  • 右侧数据集: ri.foundry.main.dataset.right
  • 右侧列的前缀: null

输入: ri.foundry.main.dataset.left

tail_numberairlinemilesfactor
XB-123foundry air1242
MT-222new airline11235
XB-123foundry airline3355
MT-222new air5654
KK-452new air2221
PA-452new air2122
XB-123foundry airline11342

ri.foundry.main.dataset.right

tail_numberhome_airport
XB-123LHR
MT-222CPH
KK-452JFK
JR-201IAD

输出:

tail_numberairlinemilesfactorhome_airport
XB-123foundry air1242LHR
MT-222new airline11235CPH
XB-123foundry airline3355LHR
MT-222new air5654CPH
KK-452new air2221JFK
PA-452new air2122null
XB-123foundry airline11342LHR
JR-201nullnullnullIAD