|
> From: Brian Piotrowski > > Let me give you an example - we have a physical file that contains over > 100,000 records. That's not a lot of records, really. Using native I/O, you can pretty easily read 10,000 records a second; probably more with blocking. > We have a program that searches from a "FROM" date and > reads to either the end of the file or the "TO" date, whichever comes > first. The program that does the READ statement uses a logical file. > The SQL statement uses a BETWEEN parameter to only select those records > that match the indicated criteria. Yes, but what does it DO with the data? Are you updating each record? Writing to another file? Or are you summing a field and getting a total? What you wish to do very much dictates how you should do it. > >From what understand, there would be a disk I/O for every record it > needs to read between the FROM and TO dates. However, with an SQL > SELECT statement, I would only encounter a single disk I/O. Again, what would you be SELECTing? If you're selecting 10,000 records into a multiply occurring data structure, that's different than selecting the SUM of some field in those 10,000 records. But in either case, the system still needs to go out and get those 10,000 records from disk. Either it gets them one at a time or gets them in a block. When a block is read, the more records that can be used from that block the better. So, if your data on disk is in the same order as the data you are reading, sequential reads will go much faster, because fewer physical reads need to occur. But I've never heard of one read getting all 10,000 records. > Is this correct? In the case of this above program, which is the most > efficient way of getting the data? And one more time, the answer depends on what data you are getting. Although I've written some seriously technical benchmarks, I always tell people that you need to determine the business goal before you can use a benchmark. So tell us what you are trying to do with those records. Joe P.S. I will be rerunning my massive benchmark tests later this year when the new V5R3 box comes in. We can get some release-to-release comparisons. But on V5R1, for a simple CHAIN operation, native I/O ran 14000 records per second, while a single-record SELECT INTO ran 1400 records per second and a FETCH on a cursor ran about 950 records per second. I don't have statistics for block reads.
As an Amazon Associate we earn from qualifying purchases.
This mailing list archive is Copyright 1997-2025 by midrange.com and David Gibbs as a compilation work. Use of the archive is restricted to research of a business or technical nature. Any other uses are prohibited. Full details are available on our policy page. If you have questions about this, please contact [javascript protected email address].
Operating expenses for this site are earned using the Amazon Associate program and Google Adsense.