【发布时间】:2022-01-16 07:53:18
【问题描述】:
看起来System.Xml.Linq 正在消耗大量内存,即使在任何资源应该被释放之后也是如此。
一个简单的演示
await using ( System.IO.FileStream stream = new ( xmlFilePath, System.IO.FileMode.Open) ) {
using ( System.Xml.XmlReader reader = System.Xml.XmlReader.Create( stream, new () { ConformanceLevel = System.Xml.ConformanceLevel.Fragment, Async = true } ) ) {
int i = 0;
while ( await reader.ReadAsync().ConfigureAwait( false ) ) {
while ( reader.NodeType != System.Xml.XmlNodeType.None ) {
if ( reader.NodeType == System.Xml.XmlNodeType.XmlDeclaration ) {
await reader.SkipAsync().ConfigureAwait( false );
continue;
}
if ( ct.IsCancellationRequested ) {
continue;
}
i++;
if ( i % 100000 == 0 ) {
Console.WriteLine( $"Processed {i}: {reader.ReadString()}" );
}
System.Xml.Linq.XNode node = await System.Xml.Linq.XNode.ReadFromAsync( reader, ct ).ConfigureAwait( false );
}
}
}
}
Console.WriteLine( $"\n---->Memory Use/false: {GC.GetTotalMemory(false):N0}");
Console.WriteLine( $"---->Memory Use : {GC.GetTotalMemory(true):N0}\n");
return;
输出:
---->Memory Use/false: 402,639,448
---->Memory Use : 400,967,152
如果我替换 XNode 部分,
string xmlFilePath = "/home/eric/dev/src/github.com/erichiller/mkmrk-dotnet/src/Cli/dataset/cme/definition/2021/11/2021-11-05/20211104.061134-05_20211104.030927-05_cmeg.nymex.fut.prf.xml";
await using ( System.IO.FileStream stream = new ( xmlFilePath, System.IO.FileMode.Open) ) {
using ( System.Xml.XmlReader reader = System.Xml.XmlReader.Create( stream, new () { ConformanceLevel = System.Xml.ConformanceLevel.Fragment, Async = true } ) ) {
int i = 0;
while ( await reader.ReadAsync().ConfigureAwait( false ) ) {
while ( reader.NodeType != System.Xml.XmlNodeType.None ) {
if ( reader.NodeType == System.Xml.XmlNodeType.XmlDeclaration ) {
await reader.SkipAsync().ConfigureAwait( false );
continue;
}
if ( ct.IsCancellationRequested ) {
continue;
}
i++;
if ( i % 100000 == 0 ) {
Console.WriteLine( $"Processed {i}: {reader.ReadString()}" );
}
await reader.ReadAsync().ConfigureAwait( false );
}
}
}
}
Console.WriteLine( $"\n---->Memory Use/false: {GC.GetTotalMemory(false):N0}");
Console.WriteLine( $"---->Memory Use : {GC.GetTotalMemory(true):N0}\n");
return;
使用量大幅下降:
---->Memory Use/false: 11,048,992
---->Memory Use : 6,317,248
我在这里误解了什么/做错了什么?正在加载的文件很大(~60MB),但即使 XNode 需要使用这么多内存,不应该在到达Console.WriteLine 时释放它吗?
【问题讨论】:
-
不 - 它的不确定性 - .net 是 gc'd,一旦块关闭,事情并不总是从堆中释放
-
出于好奇,你为什么不叫break;而不是继续;取消令牌何时取消?
-
我最终重写为直接使用 XmlReader 而不是通过 System.Xml.Linq ;更好的性能和内存消耗 (~40MB)
标签: c# .net xml linq .net-core