当前位置：文江博客话题详情

python中的优雅解决方案以提取数据并将其放置在基本阵列格式下

发布于 2025-02-06 15:39:29 字数 1448 浏览 1 评论 0原文

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

君勿笑 2025-02-13 15:39:29

很难说以下内容是优雅的还是漂亮的，但我认为这有点“ Pythonic”。我们可以使用以下函数来解析Wolfram输出，该函数将其作为输入输入到该文件的打开文件指针：

def parse_wolfram(file_pointer):
    # the first line is the header, which we ignore
    _ = file_pointer.readline()
    row_str = str()
    out_data = []
    while True:
        # Read each line till EOF stripping leading and trailing white spaces
        line = file_pointer.readline().strip()
        if not line:
            break

        # Append each line as a string to the current row
        row_str += line
        # Find '}' to detect the end of a row
        if line.find('}') > 0:
            # Parse the row:
            # 1. Use the regular expression module to split the string
            #    where the delimiter is one or more of the character set.
            #    This produces a list of string tokens.
            # 2. [1:-1] removes the empty string tokens at the head and 
            #    tail of this list
            # 3. Use list comprehension to cast string tokens to float.
            # 4. Append list of floats for each row to output list of lists (2-D array)  
            out_data.append([float(data) for data in re.split(r'[{, }]+', row_str)[1:-1]])
            # Reset for next row
            row_str = str()

    return out_data

如果该文件格式化为OP建议：该函数可以在名为'chain..m'的文件上使用。

    with open('chain.m', 'r', encoding='utf-8') as fp:
        parsed_output = parse_wolfram(fp)
        
    print(parsed_output)
    [[0.29344728841663786, 0.00037262711145454893, 0.7061800844719075, 67.41431300170986, 1.3887122472912174, 0.0014182932914303275, 500.97644711373647, 0.0002565333937360516, 105.86185844804378], [0.29479428399557506, 0.0007813301223490133, 0.7044243858820759, 67.40475060370453, 1.3779372193629575, 6.103376259459755e-05, 500.30876628350757, 1.106337484454747e-05, 101.39952463245301]]

该输出是浮子列表的python列表。可以使用numpy.Array（parsed_output）将其转换为numpy数组。

Hard to say if the following is elegant or pretty, but I believe that it is somewhat 'pythonic'. We can parse the Wolfram output as specified using the following function that takes as input an opened file pointer to the file:

def parse_wolfram(file_pointer):
    # the first line is the header, which we ignore
    _ = file_pointer.readline()
    row_str = str()
    out_data = []
    while True:
        # Read each line till EOF stripping leading and trailing white spaces
        line = file_pointer.readline().strip()
        if not line:
            break

        # Append each line as a string to the current row
        row_str += line
        # Find '}' to detect the end of a row
        if line.find('}') > 0:
            # Parse the row:
            # 1. Use the regular expression module to split the string
            #    where the delimiter is one or more of the character set.
            #    This produces a list of string tokens.
            # 2. [1:-1] removes the empty string tokens at the head and 
            #    tail of this list
            # 3. Use list comprehension to cast string tokens to float.
            # 4. Append list of floats for each row to output list of lists (2-D array)  
            out_data.append([float(data) for data in re.split(r'[{, }]+', row_str)[1:-1]])
            # Reset for next row
            row_str = str()

    return out_data

This function can be used as such on the file named 'chain.m' if that file is formatted as the OP suggests:

    with open('chain.m', 'r', encoding='utf-8') as fp:
        parsed_output = parse_wolfram(fp)
        
    print(parsed_output)
    [[0.29344728841663786, 0.00037262711145454893, 0.7061800844719075, 67.41431300170986, 1.3887122472912174, 0.0014182932914303275, 500.97644711373647, 0.0002565333937360516, 105.86185844804378], [0.29479428399557506, 0.0007813301223490133, 0.7044243858820759, 67.40475060370453, 1.3779372193629575, 6.103376259459755e-05, 500.30876628350757, 1.106337484454747e-05, 101.39952463245301]]

This output is a python list of lists of floats. This can be converted to a numpy array using numpy.array(parsed_output).

回复收藏 0 原文

~没有更多了~

关于作者

辞旧

暂无简介

文章

26 人气

关注发私信

十二

文章 0 评论 0

关注

飞烟轻若梦

文章 0 评论 0

关注

OPleyuhuo

文章 0 评论 0

关注

wxb0109

文章 0 评论 0

关注

旧城空念

文章 0 评论 0

关注

-小熊_

文章 0 评论 0

友情链接

文江博客

python中的优雅解决方案以提取数据并将其放置在基本阵列格式下

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

十二

飞烟轻若梦

OPleyuhuo

wxb0109

旧城空念

-小熊_

友情链接

python中的优雅解决方案以提取数据并将其放置在基本阵列格式下

如果你对这篇内容有疑问，欢迎到本站社区发帖提问 参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

评论（1）

关于作者

相关话题

热门标签

推荐作者

十二

飞烟轻若梦

OPleyuhuo

wxb0109

旧城空念

-小熊_

友情链接

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。