我想构建一个图表,显示哪些标签用作给定XML文档中其他标签的子项.
我编写了这个函数来为lxml.etree树中的给定标记获取唯一的子标记集:
def iter_unique_child_tags(root, tag):
"""Iterates through unique child tags for all instances of tag.
Iteration starts at `root`.
"""
found_child_tags = set()
instances = root.iterdescendants(tag)
from itertools import chain
child_nodes = chain.from_iterable(i.getchildren() for i in instances)
child_tags = (n.tag for n in child_nodes)
for t in child_tags:
if t not in found_child_tags:
found_child_tags.add(t)
yield t
是否有一个通用图形构建器,我可以使用此函数来构建其他格式的点文件或图形?
我也怀疑是否有一个明确为此目的而设计的工具;可能是什么?
解决方法:
我最终使用python-graph.我最终使用argparse构建一个命令行界面,从XML文档中提取一些基本信息,并以pydot支持的格式构建图形图像.它被称为xmlearn,有点有用:
usage: xmlearn [-h] [-i INFILE] [-p PATH] {graph,dump,tags} ...
optional arguments:
-h, --help show this help message and exit
-i INFILE, --infile INFILE
The XML file to learn about. Defaults to stdin.
-p PATH, --path PATH An XPath to be applied to various actions.
Defaults to the root node.
subcommands:
{graph,dump,tags}
dump Dump xml data according to a set of rules.
tags Show information about tags.
graph Build a graph from the XML tags relationships.