× The internal search function is temporarily non-functional. The current search engine is no longer viable and we are researching alternatives.
As a stop gap measure, we are using Google's custom search engine service.
If you know of an easy to use, open source, search engine ... please contact support@midrange.com.



Hi Mihael,

I agree with Mark.

I don't understand why you'd allocate memory chunk-by-chunk when the total size of the stream file is easily determined beforehand? Are you translating it to/from a format like UTF-8 where the size might not match?

The way I understand it... if you call %realloc() the system will try to extend your existing allocation if it can. I think, in most cases, it will... but it's not guaranteed. Depends on the size you requested, and what other data is stored nearby. If it can't extend the allocation, it moves the data to a new allocation and returns the new pointer -- which might be a performance issue if you're doing it frequently on large allocations.

So, if you can easily get the size of the stream file beforehand (with stat(), as Mark suggests), why not allocate it all at once?

The only thing I can think of... you're doing CCSID translation, so the buffer in your program might not match. In that case, I'd try to use the file size as a guide, but be ready to extend it if necessary.

mmap() is a tool that I've tried to use in previous projects, but failed. Now I can't remember the details -- something to do with limitations of what you can/can't do with mmap(). It might've simply been a peculiarity of the project I was working on! Let us know if it helps you...?


On 1/7/2011 8:17 AM, Mark S. Waterbury wrote:
Mihael:

See embedded remarks below.

Mark

> On 1/7/2011 7:21 AM, Schmidt, Mihael wrote:
Hi,

I am loading some data from a streamfile and I have no clue how big things can get.
The "stat" API will return the size of the streamfile. (See "Unix
type APIs in the InfoCenter.)
Strategy 1: I am allocating some memory (%alloc) and for every line extend the allocated memory (%realloc).
Strategy 2: I get one "big" block of memory and only do a reallocation if the data doesn't fit. (more work on my side)

Doesn't the system do strategy 2 under the hood already? If I allocate 100 bytes the system probably allocates a page (or so) and not only 100 bytes. So a realloc should not trigger another allocation of a whole page.?!
You could easily write some small test programs to empirically determine
the "allocation stragegy" e.g. by reporting the addresses returned.

It is usually better to write your own memory management layer that
allocates large blocks from the OS, and then sub-allocates from within
these large blocks, using whatever custom allocation strategy is deemed
best for the job at hand.
Please enlighten me.
Instead of reading the entire streamfile into memory, use "mmap" (or
"mmap64") API to directly map the streamfile into teraspace. This allows
you to directly address the data in the file as a large array in memory,
and it is paged in on demand.
Thanx in advance.

Mihael Schmidt



As an Amazon Associate we earn from qualifying purchases.

This thread ...

Follow-Ups:
Replies:

Follow On AppleNews
Return to Archive home page | Return to MIDRANGE.COM home page

This mailing list archive is Copyright 1997-2025 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].

Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.