【问题标题】:Parse filename and rename with specific order按特定顺序解析文件名和重命名
【发布时间】:2018-06-20 01:53:30
【问题描述】:

我在一个目录中有许多tiff 文件并且必须重命名,但它们有数百个,因此可能很麻烦。文件看起来像这样:

basefilename_0002_-0.0.mrc
basefilename_0003_3.0.mrc
basefilename_0004_-3.0.mrc
basefilename_0005_-6.0.mrc 
basefilename_0006_6.0.mrc 
etc...

我需要做的就是更改文件名的中间部分,以便前 41 个.mrc 文件将更改为:

basefilename_0001_-0.0.mrc
basefilename_0001_3.0.mrc
basefilename_0001_-3.0.mrc
basefilename_0001_-6.0.mrc 
basefilename_0001_6.0.mrc 
etc.

以及第二批41批.mrc文件:

basefilename_0043_-0.0.mrc
basefilename_0044_3.0.mrc
basefilename_0045_-3.0.mrc
basefilename_0046_-6.0.mrc
basefilename_0047_6.0.mrc

将改名为

basefilename_0002_-0.0.mrc
basefilename_0002_3.0.mrc
basefilename_0002_-3.0.mrc
basefilename_0002_-6.0.mrc 
basefilename_0002_6.0.mrc 
etc.

所以基本上我必须在 "basefilename_" 之后和下一个 "_" 之前解析并重命名,以便数字不会升序,而只是 0001。但我有数百个,我需要确保每 41 个 @ 987654330@ 文件在basefile 名称之后和下一个描述之前的编号相同。

【问题讨论】:

  • 你在寻找一个shell脚本吗?如果有,是什么平台?
  • 是的。在 linux 上,所以可能是 bash 脚本?

标签: bash parsing rename


【解决方案1】:

您实际上可以使用 bash 本身提供的本机工具来做您需要的事情,而无需依赖任何需要生成单独子 shell 的单独实用程序。 Bash 提供了带有子字符串替换的参数扩展,您可以使用它来将_????_ 之间的文本替换为您想要的新文本(例如0001,...)。

Bash 还提供了printf -v var,它允许您使用man 3 print 的所有格式灵活性,同时让您将格式化的输出保存在var 中。因此,例如,如果我有一个值1,我想将其格式化为0001 并存储在变量blkno 中,那么printf -v "%04d" '1' 就是一个简单的问题。

结合一个简单的计数器,然后使用 bash 提供的 modulo 运算符,你可以做你需要的:

#!/bin/bash

ext=${1:-mrc}                           ## extension of files to select
declare -i blksz=${2:-41} cnt=0 blk=1   ## files per-block, counters
printf -v blkno "%04d" $blk             ## format 1st blk as 0001

for i in *.$ext; do                     ## loop over each file with extension
    ## test output showing what would be moved, to new name
    printf "mv %-28s %s\n" "$i" "${i/_*_/_${blkno}_}"
    ## mv "$i" "${i/_*_/_${blkno}_}"    ## (uncomment for actual move)
    (((cnt+1) % blksz == 0)) && {       ## check if blksz output
        ((blk++))                       ## increment blk number
        printf -v blkno "%04d" $blk     ## format as 4-digit w/leading zeros
    }
    ((cnt++))                           ## increment count
done

请注意,脚本将要循环的文件的扩展名(默认为 .mrc)和默认情况下每个块中包含的文件数 41 作为其第一个参数。

输入文件示例

我没有你的确切文件,所以我用循环和touch 生成了类似的东西,例如

basefilename_0002_-0.0.mrc
basefilename_0003_3.0.mrc
basefilename_0004_-3.0.mrc
basefilename_0005_6.0.mrc
basefilename_0006_-6.0.mrc
basefilename_0007_9.0.mrc
basefilename_0008_-9.0.mrc
basefilename_0009_12.0.mrc
basefilename_0010_-12.0.mrc
basefilename_0011_15.0.mrc
basefilename_0012_-15.0.mrc
basefilename_0013_18.0.mrc
basefilename_0014_-18.0.mrc
basefilename_0015_21.0.mrc
basefilename_0016_-21.0.mrc
basefilename_0017_24.0.mrc
basefilename_0018_-24.0.mrc
basefilename_0019_27.0.mrc
basefilename_0020_-27.0.mrc
basefilename_0021_30.0.mrc
basefilename_0022_-30.0.mrc
basefilename_0023_33.0.mrc
basefilename_0024_-33.0.mrc
basefilename_0025_36.0.mrc
basefilename_0026_-36.0.mrc
basefilename_0027_39.0.mrc
basefilename_0028_-39.0.mrc
basefilename_0029_42.0.mrc
basefilename_0030_-42.0.mrc
basefilename_0031_45.0.mrc
basefilename_0032_-45.0.mrc
basefilename_0033_48.0.mrc
basefilename_0034_-48.0.mrc
basefilename_0035_51.0.mrc
basefilename_0036_-51.0.mrc
basefilename_0037_54.0.mrc
basefilename_0038_-54.0.mrc
basefilename_0039_57.0.mrc
basefilename_0040_-57.0.mrc
basefilename_0041_60.0.mrc
basefilename_0042_-60.0.mrc
basefilename_0043_0.0.mrc
basefilename_0044_-0.0.mrc
basefilename_0045_3.0.mrc
basefilename_0046_-3.0.mrc
basefilename_0047_6.0.mrc
basefilename_0048_-6.0.mrc
basefilename_0049_9.0.mrc
basefilename_0050_-9.0.mrc
basefilename_0051_12.0.mrc
basefilename_0052_-12.0.mrc
basefilename_0053_15.0.mrc
basefilename_0054_-15.0.mrc
basefilename_0055_18.0.mrc
basefilename_0056_-18.0.mrc
basefilename_0057_21.0.mrc
basefilename_0058_-21.0.mrc
basefilename_0059_24.0.mrc
basefilename_0060_-24.0.mrc
basefilename_0061_27.0.mrc
basefilename_0062_-27.0.mrc
basefilename_0063_30.0.mrc
basefilename_0064_-30.0.mrc
basefilename_0065_33.0.mrc
basefilename_0066_-33.0.mrc
basefilename_0067_36.0.mrc
basefilename_0068_-36.0.mrc
basefilename_0069_39.0.mrc
basefilename_0070_-39.0.mrc
basefilename_0071_42.0.mrc
basefilename_0072_-42.0.mrc
basefilename_0073_45.0.mrc
basefilename_0074_-45.0.mrc
basefilename_0075_48.0.mrc
basefilename_0076_-48.0.mrc
basefilename_0077_51.0.mrc
basefilename_0078_-51.0.mrc
basefilename_0079_54.0.mrc
basefilename_0080_-54.0.mrc
basefilename_0081_57.0.mrc
basefilename_0082_-57.0.mrc
basefilename_0083_60.0.mrc
basefilename_0084_-60.0.mrc
basefilename_0085_0.0.mrc
basefilename_0086_-0.0.mrc
basefilename_0087_3.0.mrc
basefilename_0088_-3.0.mrc
basefilename_0089_6.0.mrc
basefilename_0090_-6.0.mrc
basefilename_0091_9.0.mrc
basefilename_0092_-9.0.mrc
basefilename_0093_12.0.mrc
basefilename_0094_-12.0.mrc
basefilename_0095_15.0.mrc
basefilename_0096_-15.0.mrc
basefilename_0097_18.0.mrc
basefilename_0098_-18.0.mrc
basefilename_0099_21.0.mrc
basefilename_0100_-21.0.mrc

使用/输出示例

注意:实际移动mv 行已被注释掉,以便您在执行实际移动之前测试脚本并根据需要进行调整。当您满意它按需要执行时,取消注释以 mv 开头的行。

脚本输出文件移动,文件的原始文件名和新文件名,例如

mv basefilename_0002_-0.0.mrc   basefilename_0001_-0.0.mrc
mv basefilename_0003_3.0.mrc    basefilename_0001_3.0.mrc
mv basefilename_0004_-3.0.mrc   basefilename_0001_-3.0.mrc
mv basefilename_0005_6.0.mrc    basefilename_0001_6.0.mrc
mv basefilename_0006_-6.0.mrc   basefilename_0001_-6.0.mrc
mv basefilename_0007_9.0.mrc    basefilename_0001_9.0.mrc
mv basefilename_0008_-9.0.mrc   basefilename_0001_-9.0.mrc
mv basefilename_0009_12.0.mrc   basefilename_0001_12.0.mrc
mv basefilename_0010_-12.0.mrc  basefilename_0001_-12.0.mrc
mv basefilename_0011_15.0.mrc   basefilename_0001_15.0.mrc
mv basefilename_0012_-15.0.mrc  basefilename_0001_-15.0.mrc
mv basefilename_0013_18.0.mrc   basefilename_0001_18.0.mrc
mv basefilename_0014_-18.0.mrc  basefilename_0001_-18.0.mrc
mv basefilename_0015_21.0.mrc   basefilename_0001_21.0.mrc
mv basefilename_0016_-21.0.mrc  basefilename_0001_-21.0.mrc
mv basefilename_0017_24.0.mrc   basefilename_0001_24.0.mrc
mv basefilename_0018_-24.0.mrc  basefilename_0001_-24.0.mrc
mv basefilename_0019_27.0.mrc   basefilename_0001_27.0.mrc
mv basefilename_0020_-27.0.mrc  basefilename_0001_-27.0.mrc
mv basefilename_0021_30.0.mrc   basefilename_0001_30.0.mrc
mv basefilename_0022_-30.0.mrc  basefilename_0001_-30.0.mrc
mv basefilename_0023_33.0.mrc   basefilename_0001_33.0.mrc
mv basefilename_0024_-33.0.mrc  basefilename_0001_-33.0.mrc
mv basefilename_0025_36.0.mrc   basefilename_0001_36.0.mrc
mv basefilename_0026_-36.0.mrc  basefilename_0001_-36.0.mrc
mv basefilename_0027_39.0.mrc   basefilename_0001_39.0.mrc
mv basefilename_0028_-39.0.mrc  basefilename_0001_-39.0.mrc
mv basefilename_0029_42.0.mrc   basefilename_0001_42.0.mrc
mv basefilename_0030_-42.0.mrc  basefilename_0001_-42.0.mrc
mv basefilename_0031_45.0.mrc   basefilename_0001_45.0.mrc
mv basefilename_0032_-45.0.mrc  basefilename_0001_-45.0.mrc
mv basefilename_0033_48.0.mrc   basefilename_0001_48.0.mrc
mv basefilename_0034_-48.0.mrc  basefilename_0001_-48.0.mrc
mv basefilename_0035_51.0.mrc   basefilename_0001_51.0.mrc
mv basefilename_0036_-51.0.mrc  basefilename_0001_-51.0.mrc
mv basefilename_0037_54.0.mrc   basefilename_0001_54.0.mrc
mv basefilename_0038_-54.0.mrc  basefilename_0001_-54.0.mrc
mv basefilename_0039_57.0.mrc   basefilename_0001_57.0.mrc
mv basefilename_0040_-57.0.mrc  basefilename_0001_-57.0.mrc
mv basefilename_0041_60.0.mrc   basefilename_0001_60.0.mrc
mv basefilename_0042_-60.0.mrc  basefilename_0001_-60.0.mrc
mv basefilename_0043_0.0.mrc    basefilename_0002_0.0.mrc
mv basefilename_0044_-0.0.mrc   basefilename_0002_-0.0.mrc
mv basefilename_0045_3.0.mrc    basefilename_0002_3.0.mrc
mv basefilename_0046_-3.0.mrc   basefilename_0002_-3.0.mrc
mv basefilename_0047_6.0.mrc    basefilename_0002_6.0.mrc
mv basefilename_0048_-6.0.mrc   basefilename_0002_-6.0.mrc
mv basefilename_0049_9.0.mrc    basefilename_0002_9.0.mrc
mv basefilename_0050_-9.0.mrc   basefilename_0002_-9.0.mrc
mv basefilename_0051_12.0.mrc   basefilename_0002_12.0.mrc
mv basefilename_0052_-12.0.mrc  basefilename_0002_-12.0.mrc
mv basefilename_0053_15.0.mrc   basefilename_0002_15.0.mrc
mv basefilename_0054_-15.0.mrc  basefilename_0002_-15.0.mrc
mv basefilename_0055_18.0.mrc   basefilename_0002_18.0.mrc
mv basefilename_0056_-18.0.mrc  basefilename_0002_-18.0.mrc
mv basefilename_0057_21.0.mrc   basefilename_0002_21.0.mrc
mv basefilename_0058_-21.0.mrc  basefilename_0002_-21.0.mrc
mv basefilename_0059_24.0.mrc   basefilename_0002_24.0.mrc
mv basefilename_0060_-24.0.mrc  basefilename_0002_-24.0.mrc
mv basefilename_0061_27.0.mrc   basefilename_0002_27.0.mrc
mv basefilename_0062_-27.0.mrc  basefilename_0002_-27.0.mrc
mv basefilename_0063_30.0.mrc   basefilename_0002_30.0.mrc
mv basefilename_0064_-30.0.mrc  basefilename_0002_-30.0.mrc
mv basefilename_0065_33.0.mrc   basefilename_0002_33.0.mrc
mv basefilename_0066_-33.0.mrc  basefilename_0002_-33.0.mrc
mv basefilename_0067_36.0.mrc   basefilename_0002_36.0.mrc
mv basefilename_0068_-36.0.mrc  basefilename_0002_-36.0.mrc
mv basefilename_0069_39.0.mrc   basefilename_0002_39.0.mrc
mv basefilename_0070_-39.0.mrc  basefilename_0002_-39.0.mrc
mv basefilename_0071_42.0.mrc   basefilename_0002_42.0.mrc
mv basefilename_0072_-42.0.mrc  basefilename_0002_-42.0.mrc
mv basefilename_0073_45.0.mrc   basefilename_0002_45.0.mrc
mv basefilename_0074_-45.0.mrc  basefilename_0002_-45.0.mrc
mv basefilename_0075_48.0.mrc   basefilename_0002_48.0.mrc
mv basefilename_0076_-48.0.mrc  basefilename_0002_-48.0.mrc
mv basefilename_0077_51.0.mrc   basefilename_0002_51.0.mrc
mv basefilename_0078_-51.0.mrc  basefilename_0002_-51.0.mrc
mv basefilename_0079_54.0.mrc   basefilename_0002_54.0.mrc
mv basefilename_0080_-54.0.mrc  basefilename_0002_-54.0.mrc
mv basefilename_0081_57.0.mrc   basefilename_0002_57.0.mrc
mv basefilename_0082_-57.0.mrc  basefilename_0002_-57.0.mrc
mv basefilename_0083_60.0.mrc   basefilename_0002_60.0.mrc
mv basefilename_0084_-60.0.mrc  basefilename_0003_-60.0.mrc
mv basefilename_0085_0.0.mrc    basefilename_0003_0.0.mrc
mv basefilename_0086_-0.0.mrc   basefilename_0003_-0.0.mrc
mv basefilename_0087_3.0.mrc    basefilename_0003_3.0.mrc
mv basefilename_0088_-3.0.mrc   basefilename_0003_-3.0.mrc
mv basefilename_0089_6.0.mrc    basefilename_0003_6.0.mrc
mv basefilename_0090_-6.0.mrc   basefilename_0003_-6.0.mrc
mv basefilename_0091_9.0.mrc    basefilename_0003_9.0.mrc
mv basefilename_0092_-9.0.mrc   basefilename_0003_-9.0.mrc
mv basefilename_0093_12.0.mrc   basefilename_0003_12.0.mrc
mv basefilename_0094_-12.0.mrc  basefilename_0003_-12.0.mrc
mv basefilename_0095_15.0.mrc   basefilename_0003_15.0.mrc
mv basefilename_0096_-15.0.mrc  basefilename_0003_-15.0.mrc
mv basefilename_0097_18.0.mrc   basefilename_0003_18.0.mrc
mv basefilename_0098_-18.0.mrc  basefilename_0003_-18.0.mrc
mv basefilename_0099_21.0.mrc   basefilename_0003_21.0.mrc
mv basefilename_0100_-21.0.mrc  basefilename_0003_-21.0.mrc

查看一下,如果您还有其他问题,请告诉我。

【讨论】:

  • 这正是我所需要的!在实际重命名之前,该测试也非常有用。
  • 是的,这总是一个很大的优势。始终编​​写一个仅输出脚本将执行的操作的测试用例(在实际执行之前)。这将允许您验证它是否符合您的预期 - 并且在您需要调整某些内容时为您省去无休止的悲伤......祝您的脚本好运。
  • 我在回顾剧本。我试图更好地理解脚本。这部分脚本做了什么:%-28s %s\n 。另外,假设我不想替换“-*_”中的数字,而只想将 0001 放在 .mrc 文件扩展名之前的 0002,0003 数字之前。我将如何修改脚本?
  • 这只是一个 field-width 修饰符,'-' 使字段 left-justified。因此,由于文件名是 ~28-chars,我希望它们都是 28-chars(并且左对齐),因此右列的格式很好。 %-28s %s\n 只是示例输出的完整 printf 格式字符串。除了输出将要执行的操作之外,它对脚本没有任何其他影响。有关完整详细信息,请参阅系统上的 man 3 printf
  • 如果你想要basefilename_-0.0_0001.mrc而不是basefilename_0001_-0.0.mrc,这是可行的,但需要几个额外的步骤,因为它不再是一个替换,而是一个完整的重新格式化名称部分。
【解决方案2】:

我确信有比以下 shell 脚本更好的方法来获取您想要的内容,但假设文件按需要排序,则类似以下内容应该可以工作:

#!/bin/bash

set -e

count=1
index=1

for p in *.mrc; do
    if expr $count == 42 > /dev/null; then
        index=`expr $index + 1`
        count=1
    else
        count=`expr $count + 1`
    fi

    mv $p `echo $p | sed -e "s/\(.*_\)\([0-9]*\)\(_.*\)/\1000${index}\3/"`
done

上面的sed 命令将文件名$p 分成三部分,在转义括号对\( ... \) 之间找到:

  • 基本文件名(例如foo_),保留在\1中,
  • 您正在修改的中心数字,替换为000${index},其中${index} 扩展为1 用于第一组41 个文件,2 用于第二组等,并且
  • 后缀(例如_3.0.mrc),保留在\3

这不是一个非常健壮的实现,因为如果$index 大于9,您最终可能会得到像00023 这样的中心数字,但我认为您对自己的实现有所了解。

除了sed,您还可以使用内置的 Bash 字符串操作。请参阅Advanced Bash-Scripting Guide 中的Section 10.1 Manipulating Strings

【讨论】:

  • 您的想法是正确的,您可以从我的回答中获取printf -v 技巧,以消除可能的00023 问题。最后一点是尽可能避免在 bash 中使用循环调用实用程序(例如 sed)。如果有 1000 个文件,那就是对 sed 的 1000 次调用(以及 1000 个单独的子 shell)。这将使您的脚本更加高效。
  • @DavidC.Rankin 非常好。你的方法比我的好得多。我也学到了一些新东西。谢谢! +1
猜你喜欢
  • 2017-04-10
  • 1970-01-01
  • 2020-01-16
  • 2020-10-20
  • 2015-01-24
  • 2014-04-22
  • 1970-01-01
  • 2017-01-01
  • 2015-10-10
相关资源
最近更新 更多