Python's DataFrame implements excel merged cells

Home

Backend Development

Python Tutorial

Python's DataFrame implements excel merged cells_python

不言

Apr 02, 2018 pm 04:19 PM

dataframeexcelpython

This article mainly introduces the DataFrame in python to implement excel merging cells in detail. It has a certain reference value. Interested friends can refer to it.

I often encounter the need to merge cells at work. The data is output to excel, and some of the cells need to be merged. For example, in the table below, the corresponding cells in columns B and C need to be merged based on the value of column A

## The to_excel method in #pandas can only merge indexes, while in xlsxwriter, although the merge_range method is provided, it is only a basic method, and each time you need to write tedious tests to finally adjust it, and it is not very good. Reuse. So I want to write a method myself, combining dataframe and merge_range. The general idea is:

1. Define a MY_DataFrame class and inherit the DataFrame class. This can make good use of many features of pandas without having to reorganize the data structure yourself.

2. Define a my_mergewr_excel method. The parameters are: the path to output excel, the key_cols list used to determine whether it needs to be merged, and the list used to indicate which columns of cells need to be merged.
3. Add MY_DataFrame Encapsulated as a My_Module module for reuse.

The merging algorithm is as follows:

1. According to the [key column] of the given parameters, perform group counting and sorting, and add two auxiliary columns CN and RN

2 , if it is judged that CN is greater than 1, the group needs to be merged, otherwise the group (row) does not need to be merged (CN=1 means that the data row of this group is unique and does not need to be merged)
3. Corresponding to the group that needs to be merged, judge the current column Whether it is in the given parameter [Merge Column], if so, use merge to write excel cells, otherwise, just write excel cells normally.
4. In the column that needs to be merged, if RN=1, call merge_range and write CN cells at once. If RN>1, skip the cell because RN=1 At that time, the cell has been merged and written. If erge_range is called repeatedly, an error will be reported when opening the excel document.

The explanation with pictures is as follows:

The specific code is as follows:

# -*- coding: utf-8 -*- 
""" 
Created on 20170301 
 
@author: ARK-Z 
""" 
import xlsxwriter 
 
 
import pandas as pd 
 
class My_DataFrame(pd.DataFrame): 
  def __init__(self, data=None, index=None, columns=None, dtype=None, copy=False): 
    pd.DataFrame.__init__(self, data, index, columns, dtype, copy) 
 
  def my_mergewr_excel(self,path,key_cols=[],merge_cols=[]): 
    # sheet_name=&#39;Sheet1&#39;, na_rep=&#39;&#39;, float_format=None, columns=None, header=True, index=True, index_label=None, startrow=0, startcol=0, engine=None, merge_cells=True, encoding=None, inf_rep=&#39;inf&#39;, verbose=True): 
    self_copy=My_DataFrame(self,copy=True) 
    line_cn=self_copy.index.size 
    cols=list(self_copy.columns.values) 
    if all([v in cols for i,v in enumerate(key_cols)])==False:   #校验key_cols中各元素 是否都包含与对象的列 
      print("key_cols is not completely include object&#39;s columns") 
      return False 
    if all([v in cols for i,v in enumerate(merge_cols)])==False: #校验merge_cols中各元素 是否都包含与对象的列 
      print("merge_cols is not completely include object&#39;s columns") 
      return False   
 
    wb2007 = xlsxwriter.Workbook(path) 
    worksheet2007 = wb2007.add_worksheet() 
    format_top = wb2007.add_format({&#39;border&#39;:1,&#39;bold&#39;:True,&#39;text_wrap&#39;:True}) 
    format_other = wb2007.add_format({&#39;border&#39;:1,&#39;valign&#39;:&#39;vcenter&#39;}) 
    for i,value in enumerate(cols): #写表头 
      #print(value) 
      worksheet2007.write(0,i,value,format_top) 
     
    #merge_cols=[&#39;B&#39;,&#39;A&#39;,&#39;C&#39;] 
    #key_cols=[&#39;A&#39;,&#39;B&#39;] 
    if key_cols ==[]:  #如果key_cols 参数不传值，则无需合并 
      self_copy[&#39;RN&#39;]=1 
      self_copy[&#39;CN&#39;]=1 
    else: 
      self_copy[&#39;RN&#39;]=self_copy.groupby(key_cols,as_index=False).rank(method=&#39;first&#39;).ix[:,0] #以key_cols作为是否合并的依据 
      self_copy[&#39;CN&#39;]=self_copy.groupby(key_cols,as_index=False).rank(method=&#39;max&#39;).ix[:,0] 
    #print(self) 
    for i in range(line_cn): 
      if self_copy.ix[i,&#39;CN&#39;]>1: 
        #print(&#39;该行有需要合并的单元格&#39;) 
        for j,col in enumerate(cols): 
          #print(self_copy.ix[i,col]) 
          if col in (merge_cols):  #哪些列需要合并 
            if self_copy.ix[i,&#39;RN&#39;]==1: #合并写第一个单元格，下一个第一个将不再写 
              worksheet2007.merge_range(i+1,j,i+int(self_copy.ix[i,&#39;CN&#39;]),j, self_copy.ix[i,col],format_other) ##合并单元格，根据LINE_SET[7]判断需要合并几个 
              #worksheet2007.write(i+1,j,df.ix[i,col]) 
            else: 
              pass 
            #worksheet2007.write(i+1,j,df.ix[i,j]) 
          else: 
            worksheet2007.write(i+1,j,self_copy.ix[i,col],format_other) 
          #print(&#39;,&#39;) 
      else: 
        #print(&#39;该行无需要合并的单元格&#39;) 
        for j,col in enumerate(cols): 
          #print(df.ix[i,col]) 
          worksheet2007.write(i+1,j,self_copy.ix[i,col],format_other) 
         
         
    wb2007.close() 
    self_copy.drop(&#39;CN&#39;, axis=1) 
    self_copy.drop(&#39;RN&#39;, axis=1)

Calling code:

import My_Module 
 
DF=My_DataFrame({&#39;A&#39;:[1,2,2,2,3,3],&#39;B&#39;:[1,1,1,1,1,1],&#39;C&#39;:[1,1,1,1,1,1],&#39;D&#39;:[1,1,1,1,1,1]}) 
 
DF 
Out[120]:  
  A B C D 
0 1 1 1 1 
1 2 1 1 1 
2 2 1 1 1 
3 2 1 1 1 
4 3 1 1 1 
5 3 1 1 1  


DF.my_mergewr_excel(&#39;000_2.xlsx&#39;,[&#39;A&#39;],[&#39;B&#39;,&#39;C&#39;])

The effect is as follows:

You can also set merge A , Column B:

DF.my_mergewr_excel(&#39;000_2.xlsx&#39;,[&#39;A&#39;],[&#39;A&#39;,&#39;B&#39;])

The effect is as follows:

The above is the detailed content of Python's DataFrame implements excel merged cells_python. For more information, please follow other related articles on the PHP Chinese website!

Statement

The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

What are the alternatives to concatenate two lists in Python?May 09, 2025 am 12:16 AM

There are many methods to connect two lists in Python: 1. Use operators, which are simple but inefficient in large lists; 2. Use extend method, which is efficient but will modify the original list; 3. Use the = operator, which is both efficient and readable; 4. Use itertools.chain function, which is memory efficient but requires additional import; 5. Use list parsing, which is elegant but may be too complex. The selection method should be based on the code context and requirements.

Python: Efficient Ways to Merge Two ListsMay 09, 2025 am 12:15 AM

There are many ways to merge Python lists: 1. Use operators, which are simple but not memory efficient for large lists; 2. Use extend method, which is efficient but will modify the original list; 3. Use itertools.chain, which is suitable for large data sets; 4. Use * operator, merge small to medium-sized lists in one line of code; 5. Use numpy.concatenate, which is suitable for large data sets and scenarios with high performance requirements; 6. Use append method, which is suitable for small lists but is inefficient. When selecting a method, you need to consider the list size and application scenarios.

Compiled vs Interpreted Languages: pros and consMay 09, 2025 am 12:06 AM

Compiledlanguagesofferspeedandsecurity,whileinterpretedlanguagesprovideeaseofuseandportability.1)CompiledlanguageslikeC arefasterandsecurebuthavelongerdevelopmentcyclesandplatformdependency.2)InterpretedlanguageslikePythonareeasiertouseandmoreportab

Python: For and While Loops, the most complete guideMay 09, 2025 am 12:05 AM

In Python, a for loop is used to traverse iterable objects, and a while loop is used to perform operations repeatedly when the condition is satisfied. 1) For loop example: traverse the list and print the elements. 2) While loop example: guess the number game until you guess it right. Mastering cycle principles and optimization techniques can improve code efficiency and reliability.

Python concatenate lists into a stringMay 09, 2025 am 12:02 AM

To concatenate a list into a string, using the join() method in Python is the best choice. 1) Use the join() method to concatenate the list elements into a string, such as ''.join(my_list). 2) For a list containing numbers, convert map(str, numbers) into a string before concatenating. 3) You can use generator expressions for complex formatting, such as ','.join(f'({fruit})'forfruitinfruits). 4) When processing mixed data types, use map(str, mixed_list) to ensure that all elements can be converted into strings. 5) For large lists, use ''.join(large_li

Python's Hybrid Approach: Compilation and Interpretation CombinedMay 08, 2025 am 12:16 AM

Pythonusesahybridapproach,combiningcompilationtobytecodeandinterpretation.1)Codeiscompiledtoplatform-independentbytecode.2)BytecodeisinterpretedbythePythonVirtualMachine,enhancingefficiencyandportability.

Learn the Differences Between Python's 'for' and 'while' LoopsMay 08, 2025 am 12:11 AM

ThekeydifferencesbetweenPython's"for"and"while"loopsare:1)"For"loopsareidealforiteratingoversequencesorknowniterations,while2)"while"loopsarebetterforcontinuinguntilaconditionismetwithoutpredefinediterations.Un

Python concatenate lists with duplicatesMay 08, 2025 am 12:09 AM

In Python, you can connect lists and manage duplicate elements through a variety of methods: 1) Use operators or extend() to retain all duplicate elements; 2) Convert to sets and then return to lists to remove all duplicate elements, but the original order will be lost; 3) Use loops or list comprehensions to combine sets to remove duplicate elements and maintain the original order.

See all articles