conda 包的 yaml 文件以编程方式更新

yaml file for conda packages update programmatically

我有一个 python 脚本想要读取一组 conda 配方并用一些信息更新它们(例如文件:https://github.com/williamjamir/staged-recipes/blob/850760fb63c0fc000b95ac27680ec018fa94dcb7/recipes/pyexcel-ezodf/meta.yaml):

我正在使用这个:

from ruamel.yaml import YAML

from yaml.constructor import ConstructorError
from yaml.scanner import ScannerError

yaml = YAML(typ='jinja2')
yaml.allow_duplicate_keys = True
with open(file_name) as fp:
     yalm_file = yaml.load(fp)

当我打印原始文件时 yaml_file 使用:

with open(path_file, 'w') as fp:
    yaml.dump(yaml_file, fp, allow_unicode=True, explicit_start=True) 

输出包含大量与类型相关的标签和注释 数据如:

 --- !!python/object/apply:ruamel.yaml.comments.CommentedMap
 dictitems:
 about: !!python/object/apply:ruamel.yaml.comments.CommentedMap
 dictitems: {home: 'https://github.com/soedinglab/xxmotif', license: 
 GPLv3, license_file: LICENSE,
 summary: 'eXhaustive, weight matriX-based motif discovery in nucleotide sequences'}
state:
  _yaml_format: !!python/object/new:ruamel.yaml.comments.Format
    state: !!python/tuple
    - null
    - {_flow_style: false}

我该如何解决这个问题?

YAML 实例的 dump() 方法不采用您提供的参数 (allow_unicode=True, explicit_start=True)。由于您没有提供完整的工作程序,我只能猜测您(也)做了 import ruamel.yaml as yaml(甚至 import yaml)。

由 jinja2 插件完成的转换,处理标准的 jinja2 模板语法(通常在使用 YAML 解析器解析之前处理)需要在加载时完成 和转储时 .因此,您需要使用相同的 YAML(typ='jinja2') 实例来执行此操作:

import sys
file_name = 'meta.yaml'

from ruamel.yaml import YAML

from yaml.constructor import ConstructorError
from yaml.scanner import ScannerError

yaml = YAML(typ='jinja2')
yaml.allow_duplicate_keys = True
yaml.indent(sequence=4, offset=2)
yaml.preserve_quotes = True
# yaml.explicit_start = True
with open(file_name) as fp:
     data = yaml.load(fp)

yaml.dump(data, sys.stdout)

在往返时准确给出您的输入:

{% set name = "pyexcel-ezodf" %}
{% set version = "0.3.3" %}
{% set sha256 = "26ddddc61c6bbe2641a15964ba57eaf92a171478e7ed9efb9ae4db1567d0998
c" %}

package:
  name: {{ name|lower }}
  version: {{ version }}

source:
  fn: {{ name }}-{{ version }}.tar.gz
  # The github url is been used because the tar.gz from pypi is missing the CONT
RIBUTORS.rst file
  url: https://github.com/pyexcel/{{ name }}/archive/v{{ version }}.tar.gz
  sha256: {{ sha256 }}

build:
  noarch: python
  number: 0
  script: python setup.py install --single-version-externally-managed --record r
ecord.txt

requirements:
  build:
    - python
    - setuptools

  run:
    - python
    - lxml

test:
  imports:
    - ezodf


about:
  home: https://github.com/pyexcel/pyexcel-ezodf
  license: MIT
  license_family: MIT
  license_file: '{{ environ["RECIPE_DIR"] }}/LICENSE'
  summary: 'A Python package to create/manipulate OpenDocumentFormat files'
  description: |
    'ezodf is a Python package to create new or open existing' +
    'OpenDocument (ODF) files to extract, add, modify or delete document data' +
    'forked from dead project https://bitbucket.org/mozman/ezodf' +
    'format and to/from databases' +
    ''
  dev_url: https://github.com/pyexcel/pyexcel-ezodf

extra:
  recipe-maintainers:
    - williamjamir

您不需要设置 allow_unicode,这是 YAML.dump() 的默认设置。