如何使用LightCells API在Aspose.cells中读写大型Excel文件
我想利用Aspose中的LightCells API。电池以帮助优化我的读写过程,同时还可以在处理Excel文件时降低内存利用率。
我对此API的实现有疑问。如何使用轻型细胞API- LightCellSdatapRovider和LightCellSdatahandler-通过更少的内存利用率有效地读取和写Excel。我特别想将单元的数据加载到内存中的数据(例如,仅处理前100行,然后是下一个100行,依此类推),以便在整个过程中消耗较少的内存。
想要在下面实现代码,但在Aspose中使用LightCell API -
public static void createTemplate( String originalFilePath, List<String> originalFileSheetNameList, String templateFilePath ) throws Exception {
LoadOptions opts = new LoadOptions();
opts.setMemorySetting(MemorySetting.MEMORY_PREFERENCE);
Workbook wb = new Workbook(originalFilePath, opts);
WorksheetCollection wsCollections = wb.getWorksheets();
for(String sheetName: originalFileSheetNameList) {
Worksheet sheet = wsCollections.get(sheetName);
sheet.getCells().deleteRows(1, sheet.getCells().getMaxDataRow());
}
FileOutputStream outputStream = new FileOutputStream(templateFilePath);
wb.save(outputStream, SaveFormat.XLSX);
outputStream.close();
}
代码是Aspose.cells Light Cell API实现。 使用LightCells我如何实现上述代码 -
public static void createTemplate(String originalFilePath, List<String> originalFileSheetName, String templateFilePath ) throws Exception {
LoadOptions opts = new LoadOptions();
LightCellsDataHandlerVisitCells v = new LightCellsDataHandlerVisitCells();
opts.setLightCellsDataHandler((LightCellsDataHandler) v);
Workbook wb = new Workbook(originalFilePath, opts);
int sheetCount = wb.getWorksheets().getCount();
v.startSheet(wb.getWorksheets().get("System_Data"));
System.out.println("Total sheets: " + sheetCount + ", cells: " + v.cellCount + ", strings: " + v.stringCount
+ ", formulas: " + v.formulaCount);
}
用于阅读 -
import com.aspose.cells.Cell;
import com.aspose.cells.CellValueType;
import com.aspose.cells.LightCellsDataHandler;
import com.aspose.cells.Row;
import com.aspose.cells.Worksheet;
public class LightCellsDataHandlerVisitCells implements LightCellsDataHandler {
public int cellCount;
public int formulaCount;
public int stringCount;
public LightCellsDataHandlerVisitCells() {
this.cellCount = 0;
this.formulaCount = 0;
this.stringCount = 0;
}
public int cellCount() {
return cellCount;
}
public int formulaCount() {
return formulaCount;
}
public int stringCount() {
return stringCount;
}
public boolean startSheet(Worksheet sheet) {
System.out.println("Processing sheet[" + sheet.getName() + "]");
System.out.println("Processing sheet rows[" + sheet.getCells().getMaxDataRow() + "]");
return true;
}
public boolean startRow(int rowIndex) {
return true;
}
public boolean processRow(Row row) {
return true;
}
public boolean startCell(int column) {
return true;
}
public boolean processCell(Cell cell) {
this.cellCount = this.cellCount + 1;
cell.getRow();
if (cell.isFormula()) {
this.formulaCount = this.formulaCount + 1;
} else if (cell.getType() == CellValueType.IS_STRING) {
this.stringCount = this.stringCount + 1;
}
return false;
}
}
在system.out.println上print -1 print -1(“处理表行[ +) Sheet.GetCells()。getMaxDatarow() +“]”);
用于写作 -
public class LightCellsDataProviderDemo implements LightCellsDataProvider {
private final int sheetCount;
private final int maxRowIndex;
private final int maxColIndex;
private int rowIndex;
private int colIndex;
private final Style style1;
private final Style style2;
public LightCellsDataProviderDemo(Workbook wb, int sheetCount, int rowCount, int colCount) {
// set the variables/objects
this.sheetCount = sheetCount;
this.maxRowIndex = rowCount - 1;
this.maxColIndex = colCount - 1;
// add new style object with specific formattings
style1 = wb.createStyle();
Font font = style1.getFont();
font.setName("MS Sans Serif");
font.setSize(10);
font.setBold(true);
font.setItalic(true);
font.setUnderline(FontUnderlineType.SINGLE);
font.setColor(Color.fromArgb(0xffff0000));
style1.setHorizontalAlignment(TextAlignmentType.CENTER);
// create another style
style2 = wb.createStyle();
style2.setCustom("#,##0.00");
font = style2.getFont();
font.setName("Copperplate Gothic Bold");
font.setSize(8);
style2.setPattern(BackgroundType.SOLID);
style2.setForegroundColor(Color.fromArgb(0xff0000ff));
style2.setBorder(BorderType.TOP_BORDER, CellBorderType.THICK, Color.getBlack());
style2.setVerticalAlignment(TextAlignmentType.CENTER);
}
public boolean isGatherString() {
return false;
}
public int nextCell() {
if (colIndex < maxColIndex) {
colIndex++;
return colIndex;
}
return -1;
}
public int nextRow() {
if (rowIndex < maxRowIndex) {
rowIndex++;
colIndex = -1; // reset column index
if (rowIndex % 1000 == 0) {
System.out.println("Row " + rowIndex);
}
return rowIndex;
}
return -1;
}
public void startCell(Cell cell) {
if (rowIndex % 50 == 0 && (colIndex == 0 || colIndex == 3)) {
// do not change the content of hyperlink.
return;
}
if (colIndex < 10) {
cell.putValue("test_" + rowIndex + "_" + colIndex);
cell.setStyle(style1);
} else {
if (colIndex == 19) {
cell.setFormula("=Rand() + test!L1");
} else {
cell.putValue(rowIndex * colIndex);
}
cell.setStyle(style2);
}
}
public void startRow(Row row) {
row.setHeight(25);
}
public boolean startSheet(int sheetIndex) {
if (sheetIndex < sheetCount) {
// reset row/column index
rowIndex = -1;
colIndex = -1;
return true;
}
return false;
}`
source - https://forum.aspose.com/t/read-write-com/t/read-write-巨大的excel-files-1m行/38441/8 https://docs.aspose.com/cell.com/cells/cells/java/java/java/ -api/
I want to make use of the LightCells API within Aspose.Cells to help optimize my read and write processes while also reducing the memory utilization while processing the excel file.
I have queries regarding the implementation of this API. How can I use light cell APIs - LightCellsDataProvider and LightCellsDataHandler - to efficiently read and write Excel through less memory utilization. I specifically want to load the data of the cells in memory in chunks (for e.g. processing only the first 100 rows and then the next 100 and so on) so that less memory will be consumed in the entire process.
Want to implement below code but using lightCell api in aspose-
public static void createTemplate( String originalFilePath, List<String> originalFileSheetNameList, String templateFilePath ) throws Exception {
LoadOptions opts = new LoadOptions();
opts.setMemorySetting(MemorySetting.MEMORY_PREFERENCE);
Workbook wb = new Workbook(originalFilePath, opts);
WorksheetCollection wsCollections = wb.getWorksheets();
for(String sheetName: originalFileSheetNameList) {
Worksheet sheet = wsCollections.get(sheetName);
sheet.getCells().deleteRows(1, sheet.getCells().getMaxDataRow());
}
FileOutputStream outputStream = new FileOutputStream(templateFilePath);
wb.save(outputStream, SaveFormat.XLSX);
outputStream.close();
}
Below code is the Aspose.Cells light cell api implementation.
Using LightCells how can I implement above code -
public static void createTemplate(String originalFilePath, List<String> originalFileSheetName, String templateFilePath ) throws Exception {
LoadOptions opts = new LoadOptions();
LightCellsDataHandlerVisitCells v = new LightCellsDataHandlerVisitCells();
opts.setLightCellsDataHandler((LightCellsDataHandler) v);
Workbook wb = new Workbook(originalFilePath, opts);
int sheetCount = wb.getWorksheets().getCount();
v.startSheet(wb.getWorksheets().get("System_Data"));
System.out.println("Total sheets: " + sheetCount + ", cells: " + v.cellCount + ", strings: " + v.stringCount
+ ", formulas: " + v.formulaCount);
}
For reading -
import com.aspose.cells.Cell;
import com.aspose.cells.CellValueType;
import com.aspose.cells.LightCellsDataHandler;
import com.aspose.cells.Row;
import com.aspose.cells.Worksheet;
public class LightCellsDataHandlerVisitCells implements LightCellsDataHandler {
public int cellCount;
public int formulaCount;
public int stringCount;
public LightCellsDataHandlerVisitCells() {
this.cellCount = 0;
this.formulaCount = 0;
this.stringCount = 0;
}
public int cellCount() {
return cellCount;
}
public int formulaCount() {
return formulaCount;
}
public int stringCount() {
return stringCount;
}
public boolean startSheet(Worksheet sheet) {
System.out.println("Processing sheet[" + sheet.getName() + "]");
System.out.println("Processing sheet rows[" + sheet.getCells().getMaxDataRow() + "]");
return true;
}
public boolean startRow(int rowIndex) {
return true;
}
public boolean processRow(Row row) {
return true;
}
public boolean startCell(int column) {
return true;
}
public boolean processCell(Cell cell) {
this.cellCount = this.cellCount + 1;
cell.getRow();
if (cell.isFormula()) {
this.formulaCount = this.formulaCount + 1;
} else if (cell.getType() == CellValueType.IS_STRING) {
this.stringCount = this.stringCount + 1;
}
return false;
}
}
getting print -1 on System.out.println("Processing sheet rows[" + sheet.getCells().getMaxDataRow() + "]");
For writing -
public class LightCellsDataProviderDemo implements LightCellsDataProvider {
private final int sheetCount;
private final int maxRowIndex;
private final int maxColIndex;
private int rowIndex;
private int colIndex;
private final Style style1;
private final Style style2;
public LightCellsDataProviderDemo(Workbook wb, int sheetCount, int rowCount, int colCount) {
// set the variables/objects
this.sheetCount = sheetCount;
this.maxRowIndex = rowCount - 1;
this.maxColIndex = colCount - 1;
// add new style object with specific formattings
style1 = wb.createStyle();
Font font = style1.getFont();
font.setName("MS Sans Serif");
font.setSize(10);
font.setBold(true);
font.setItalic(true);
font.setUnderline(FontUnderlineType.SINGLE);
font.setColor(Color.fromArgb(0xffff0000));
style1.setHorizontalAlignment(TextAlignmentType.CENTER);
// create another style
style2 = wb.createStyle();
style2.setCustom("#,##0.00");
font = style2.getFont();
font.setName("Copperplate Gothic Bold");
font.setSize(8);
style2.setPattern(BackgroundType.SOLID);
style2.setForegroundColor(Color.fromArgb(0xff0000ff));
style2.setBorder(BorderType.TOP_BORDER, CellBorderType.THICK, Color.getBlack());
style2.setVerticalAlignment(TextAlignmentType.CENTER);
}
public boolean isGatherString() {
return false;
}
public int nextCell() {
if (colIndex < maxColIndex) {
colIndex++;
return colIndex;
}
return -1;
}
public int nextRow() {
if (rowIndex < maxRowIndex) {
rowIndex++;
colIndex = -1; // reset column index
if (rowIndex % 1000 == 0) {
System.out.println("Row " + rowIndex);
}
return rowIndex;
}
return -1;
}
public void startCell(Cell cell) {
if (rowIndex % 50 == 0 && (colIndex == 0 || colIndex == 3)) {
// do not change the content of hyperlink.
return;
}
if (colIndex < 10) {
cell.putValue("test_" + rowIndex + "_" + colIndex);
cell.setStyle(style1);
} else {
if (colIndex == 19) {
cell.setFormula("=Rand() + test!L1");
} else {
cell.putValue(rowIndex * colIndex);
}
cell.setStyle(style2);
}
}
public void startRow(Row row) {
row.setHeight(25);
}
public boolean startSheet(int sheetIndex) {
if (sheetIndex < sheetCount) {
// reset row/column index
rowIndex = -1;
colIndex = -1;
return true;
}
return false;
}`
Source - https://forum.aspose.com/t/read-write-huge-excel-files-1m-rows/38441/8
https://docs.aspose.com/cells/java/using-lightcells-api/
如果你对这篇内容有疑问,欢迎到本站社区发帖提问 参与讨论,获取更多帮助,或者扫码二维码加入 Web 技术交流群。
data:image/s3,"s3://crabby-images/d5906/d59060df4059a6cc364216c4d63ceec29ef7fe66" alt="扫码二维码加入Web技术交流群"
绑定邮箱获取回复消息
由于您还没有绑定你的真实邮箱,如果其他用户或者作者回复了您的评论,将不能在第一时间通知您!
发布评论
评论(1)
您需要首先了解 Light Cell API 的对象模型、内部结构和工作原理。 LightCells API(由Aspose.Cells提供)主要旨在一一操作单元格数据,而不需要将完整的数据模型块(使用Cell集合等)构建到内存中。它以事件驱动模式工作。保存工作簿时,保存时会逐个提供单元格内容,组件直接将其保存到输出文件中。类似地,在读取模板文件时,该组件会解析每个单元格并一一提供它们的值。
简而言之,当使用 light cells API 时,一个 Cell 对象被处理然后被丢弃,Workbook 对象在内存中根本不保存该集合。在处理包含数据的大型 Excel 文件时,该模型肯定会节省大量内存。
请考虑并注意:
根据您的需要,您可以选择 XLSX 文件格式而不是 XLS,因为 XLSX 是一种基于 Open Office XML 的电子表格格式。一般来说,使用LightCells API保存XLSX文件可以比使用普通方式节省50%甚至更多的内存,保存XLS可以节省20-40%左右的内存。
正如我上面提到的,您不需要像在 LightCells API 中那样执行此操作,数据是逐个单元处理的(无需为整个电子表格构建完整的数据模型),这是更注重性能的方法,消耗更少的内存并且工作效率更高。在轻量级模式下,一个单元格被处理后立即丢弃,然后导航到下一个单元格,依此类推。
在 LightCells 模式下,导入附加数据、删除行、列或单元格范围可能不起作用(考虑 LightCells API 的架构和性质),因此您必须在实现之外执行这些操作,即执行删除等操作实现光单元接口的类中的行/列或单元格。
如果您对 light cells API 的使用还有更多疑问、疑问或问题,您也可以在专门的 中发帖论坛。
附言。我在 Aspose 担任支持开发人员/布道者。
You need to understand the object model, internal structure and working of the light cells APIs first. The LightCells APIs (provided by Aspose.Cells) are mainly designed to manipulate cell data one by one without building a complete data model block (using the Cell collection, etc.) into memory. It works in an event-driven mode. To save workbooks, it will provide the cell content cell by cell when saving, and the component saves it to the output file directly. Similarly when reading template files, the component parses every cell and provides their value one by one.
In short, when using light cells APIs, one Cell object is processed and then discarded, the Workbook object does not hold the collection at all in the memory. This model would surely save lots of memory when handling large Excel file with data.
Please consider and note:
For your needs, you may choose XLSX file format instead of XLS, because XLSX is an Open Office XML-based spreadsheet format. Generally, using LightCells API to save XLSX file may save 50% or more memory than using the common way, saving XLS may save about 20-40% memory.
As I mentioned above, you do not need to do this as in LightCells APIs, data is processed cell by cell (without building the complete data model for the whole spreadsheet) which is more performance oriented approach and will consume less memory and works efficiently. In light weight mode, one cell is processed and then discarded immediately and then it navigates to next cell and so on.
In LightCells mode, importing additional data, deleting rows, columns or range of cells may not work (considering the architecture and nature of the LightCells APIs), so you have to do such things outside of the implementation, i.e., perform the operations like removing rows/cols or cells in the class that implements the light cells interface.
Should you have further queries, doubts or issues working with light cells APIs, you may also post in the dedicated forums.
PS. I am working as Support developer/ Evangelist at Aspose.