ff (Q35166): Difference between revisions
From MaRDI portal
Removed claim: author (P16): Walter Zucchini (Q79481) |
Swh import (talk | contribs) SWHID from Software Heritage |
||||||||||||||
(12 intermediate revisions by 5 users not shown) | |||||||||||||||
Property / last update | |||||||||||||||
| |||||||||||||||
Property / last update: 25 January 2023 / rank | |||||||||||||||
Property / copyright license | |||||||||||||||
Property / copyright license: GNU General Public License, version 2.0 / rank | |||||||||||||||
Property / copyright license | |||||||||||||||
Property / copyright license: GNU General Public License, version 3.0 / rank | |||||||||||||||
Property / copyright license | |||||||||||||||
Property / copyright license: File License / rank | |||||||||||||||
Property / copyright license: File License / qualifier | |||||||||||||||
Property / depends on software | |||||||||||||||
Property / depends on software: bit / rank | |||||||||||||||
Property / depends on software: bit / qualifier | |||||||||||||||
Property / depends on software | |||||||||||||||
Property / depends on software: utils / rank | |||||||||||||||
Property / depends on software | |||||||||||||||
Property / depends on software: R / rank | |||||||||||||||
Property / software version identifier | |||||||||||||||
1.0-1 | |||||||||||||||
Property / software version identifier: 1.0-1 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 1.0-1 / qualifier | |||||||||||||||
publication date: 3 November 2007
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.0.0 | |||||||||||||||
Property / software version identifier: 2.0.0 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.0.0 / qualifier | |||||||||||||||
publication date: 3 August 2008
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.0.1 | |||||||||||||||
Property / software version identifier: 2.0.1 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.0.1 / qualifier | |||||||||||||||
publication date: 16 April 2009
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.1-0 | |||||||||||||||
Property / software version identifier: 2.1-0 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.1-0 / qualifier | |||||||||||||||
publication date: 3 November 2009
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.1-1 | |||||||||||||||
Property / software version identifier: 2.1-1 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.1-1 / qualifier | |||||||||||||||
publication date: 5 November 2009
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.1-2 | |||||||||||||||
Property / software version identifier: 2.1-2 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.1-2 / qualifier | |||||||||||||||
publication date: 20 January 2010
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-0 | |||||||||||||||
Property / software version identifier: 2.2-0 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-0 / qualifier | |||||||||||||||
publication date: 30 September 2010
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-1 | |||||||||||||||
Property / software version identifier: 2.2-1 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-1 / qualifier | |||||||||||||||
publication date: 5 October 2010
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-2 | |||||||||||||||
Property / software version identifier: 2.2-2 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-2 / qualifier | |||||||||||||||
publication date: 24 April 2011
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-3 | |||||||||||||||
Property / software version identifier: 2.2-3 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-3 / qualifier | |||||||||||||||
publication date: 25 July 2011
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-4 | |||||||||||||||
Property / software version identifier: 2.2-4 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-4 / qualifier | |||||||||||||||
publication date: 15 January 2012
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-5 | |||||||||||||||
Property / software version identifier: 2.2-5 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-5 / qualifier | |||||||||||||||
publication date: 27 January 2012
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-6 | |||||||||||||||
Property / software version identifier: 2.2-6 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-6 / qualifier | |||||||||||||||
publication date: 31 March 2012
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-7 | |||||||||||||||
Property / software version identifier: 2.2-7 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-7 / qualifier | |||||||||||||||
publication date: 8 May 2012
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-9 | |||||||||||||||
Property / software version identifier: 2.2-9 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-9 / qualifier | |||||||||||||||
publication date: 24 October 2012
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-10 | |||||||||||||||
Property / software version identifier: 2.2-10 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-10 / qualifier | |||||||||||||||
publication date: 6 November 2012
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-11 | |||||||||||||||
Property / software version identifier: 2.2-11 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-11 / qualifier | |||||||||||||||
publication date: 12 March 2013
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-12 | |||||||||||||||
Property / software version identifier: 2.2-12 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-12 / qualifier | |||||||||||||||
publication date: 16 October 2013
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-13 | |||||||||||||||
Property / software version identifier: 2.2-13 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-13 / qualifier | |||||||||||||||
publication date: 9 April 2014
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-14.2 | |||||||||||||||
Property / software version identifier: 2.2-14.2 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-14.2 / qualifier | |||||||||||||||
publication date: 16 April 2020
| |||||||||||||||
Property / software version identifier | |||||||||||||||
2.2-14 | |||||||||||||||
Property / software version identifier: 2.2-14 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 2.2-14 / qualifier | |||||||||||||||
publication date: 15 May 2018
| |||||||||||||||
Property / software version identifier | |||||||||||||||
4.0.2 | |||||||||||||||
Property / software version identifier: 4.0.2 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 4.0.2 / qualifier | |||||||||||||||
publication date: 30 July 2020
| |||||||||||||||
Property / software version identifier | |||||||||||||||
4.0.4 | |||||||||||||||
Property / software version identifier: 4.0.4 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 4.0.4 / qualifier | |||||||||||||||
publication date: 13 October 2020
| |||||||||||||||
Property / software version identifier | |||||||||||||||
4.0.5 | |||||||||||||||
Property / software version identifier: 4.0.5 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 4.0.5 / qualifier | |||||||||||||||
publication date: 29 October 2021
| |||||||||||||||
Property / software version identifier | |||||||||||||||
4.0.7 | |||||||||||||||
Property / software version identifier: 4.0.7 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 4.0.7 / qualifier | |||||||||||||||
publication date: 6 May 2022
| |||||||||||||||
Property / software version identifier | |||||||||||||||
4.0.12 | |||||||||||||||
Property / software version identifier: 4.0.12 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / software version identifier: 4.0.12 / qualifier | |||||||||||||||
publication date: 12 January 2024
| |||||||||||||||
Property / last update | |||||||||||||||
12 January 2024
| |||||||||||||||
Property / last update: 12 January 2024 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / description | |||||||||||||||
The ff package provides data structures that are stored ondisk but behave (almost) as if they were in RAM by transparently mapping only a section (pagesize) in main memory - the effective virtual memory consumption per ff object. ff supports R's standard atomic data types 'double', 'logical', 'raw' and 'integer' and non-standard atomic types boolean (1 bit), quad (2 bit unsigned), nibble (4 bit unsigned), byte (1 byte signed with NAs), ubyte (1 byte unsigned), short (2 byte signed with NAs), ushort (2 byte unsigned), single (4 byte float with NAs). For example 'quad' allows efficient storage of genomic data as an 'A','T','G','C' factor. The unsigned types support 'circular' arithmetic. There is also support for close-to-atomic types 'factor', 'ordered', 'POSIXct', 'Date' and custom close-to-atomic types. ff not only has native C-support for vectors, matrices and arrays with flexible dimorder (major column-order, major row-order and generalizations for arrays). There is also a ffdf class not unlike data.frames and import/export filters for csv files.ff objects store raw data in binary flat files in native encoding,and complement this with metadata stored in R as physical and virtualattributes. ff objects have well-defined hybrid copying semantics, which gives rise to certain performance improvements through virtualization. ff objects can be stored and reopened across R sessions. ff files can be shared by multiple ff R objects (using different data en/de-coding schemes) in the same process or from multiple R processes to exploit parallelism. A wide choice of finalizer options allows to work with 'permanent' files as well as creating/removing 'temporary' ff files completely transparent to the user. On certain OS/Filesystem combinations, creating the ff filesworks without notable delay thanks to using sparse file allocation.Several access optimization techniques such as Hybrid Index Preprocessing and Virtualization are implemented to achieve good performance even with large datasets, for example virtual matrix transpose without touching a single byte on disk. Further, to reduce disk I/O, 'logicals' and non-standard data types get stored native and compact on binary flat files i.e. logicals take up exactly 2 bits to represent TRUE, FALSE and NA. Beyond basic access functions, the ff package also provides compatibility functions that facilitate writing code for ff and ram objects and support for batch processing on ff objects (e.g. as.ram, as.ff, ffapply). ff interfaces closely with functionality from package 'bit': chunked looping, fast bit operations and coercions between different objects that can store subscript information ('bit', 'bitwhich', ff 'boolean', ri range index, hi hybrid index). This allowsto work interactively with selections of large datasets and quickly modify selection criteria. Further high-performance enhancements can be made available upon request. | |||||||||||||||
Property / description: The ff package provides data structures that are stored ondisk but behave (almost) as if they were in RAM by transparently mapping only a section (pagesize) in main memory - the effective virtual memory consumption per ff object. ff supports R's standard atomic data types 'double', 'logical', 'raw' and 'integer' and non-standard atomic types boolean (1 bit), quad (2 bit unsigned), nibble (4 bit unsigned), byte (1 byte signed with NAs), ubyte (1 byte unsigned), short (2 byte signed with NAs), ushort (2 byte unsigned), single (4 byte float with NAs). For example 'quad' allows efficient storage of genomic data as an 'A','T','G','C' factor. The unsigned types support 'circular' arithmetic. There is also support for close-to-atomic types 'factor', 'ordered', 'POSIXct', 'Date' and custom close-to-atomic types. ff not only has native C-support for vectors, matrices and arrays with flexible dimorder (major column-order, major row-order and generalizations for arrays). There is also a ffdf class not unlike data.frames and import/export filters for csv files.ff objects store raw data in binary flat files in native encoding,and complement this with metadata stored in R as physical and virtualattributes. ff objects have well-defined hybrid copying semantics, which gives rise to certain performance improvements through virtualization. ff objects can be stored and reopened across R sessions. ff files can be shared by multiple ff R objects (using different data en/de-coding schemes) in the same process or from multiple R processes to exploit parallelism. A wide choice of finalizer options allows to work with 'permanent' files as well as creating/removing 'temporary' ff files completely transparent to the user. On certain OS/Filesystem combinations, creating the ff filesworks without notable delay thanks to using sparse file allocation.Several access optimization techniques such as Hybrid Index Preprocessing and Virtualization are implemented to achieve good performance even with large datasets, for example virtual matrix transpose without touching a single byte on disk. Further, to reduce disk I/O, 'logicals' and non-standard data types get stored native and compact on binary flat files i.e. logicals take up exactly 2 bits to represent TRUE, FALSE and NA. Beyond basic access functions, the ff package also provides compatibility functions that facilitate writing code for ff and ram objects and support for batch processing on ff objects (e.g. as.ram, as.ff, ffapply). ff interfaces closely with functionality from package 'bit': chunked looping, fast bit operations and coercions between different objects that can store subscript information ('bit', 'bitwhich', ff 'boolean', ri range index, hi hybrid index). This allowsto work interactively with selections of large datasets and quickly modify selection criteria. Further high-performance enhancements can be made available upon request. / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Daniel Adler / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Christian Gläser / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Oleg Nenadic / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Jens Oehlschlägel / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Martijn Schuemie / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / author | |||||||||||||||
Property / author: Walter Zucchini / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / copyright license | |||||||||||||||
Property / copyright license: GNU General Public License, version 2.0 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / copyright license | |||||||||||||||
Property / copyright license: GNU General Public License, version 3.0 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / copyright license | |||||||||||||||
Property / copyright license: File License / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / copyright license: File License / qualifier | |||||||||||||||
Property / depends on software | |||||||||||||||
Property / depends on software: bit / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / depends on software: bit / qualifier | |||||||||||||||
software version identifier: ≥ 4.0.0 | |||||||||||||||
Property / depends on software | |||||||||||||||
Property / depends on software: utils / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / depends on software | |||||||||||||||
Property / depends on software: R / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / depends on software: R / qualifier | |||||||||||||||
software version identifier: ≥ 2.10.1 | |||||||||||||||
Property / MaRDI profile type | |||||||||||||||
Property / MaRDI profile type: MaRDI software profile / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / source code repository | |||||||||||||||
Property / source code repository: https://github.com/cran/ff / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / Software Heritage ID | |||||||||||||||
Property / Software Heritage ID: swh:1:snp:03c49c1168b1e275f211d71cf6a9a846c189b8b2 / rank | |||||||||||||||
Normal rank | |||||||||||||||
Property / Software Heritage ID: swh:1:snp:03c49c1168b1e275f211d71cf6a9a846c189b8b2 / qualifier | |||||||||||||||
Property / Software Heritage ID: swh:1:snp:03c49c1168b1e275f211d71cf6a9a846c189b8b2 / qualifier | |||||||||||||||
point in time: 26 January 2024
| |||||||||||||||
links / mardi / name | links / mardi / name | ||||||||||||||
Latest revision as of 19:28, 21 March 2024
Memory-Efficient Storage of Large Data on Disk and Fast Access Functions
Language | Label | Description | Also known as |
---|---|---|---|
English | ff |
Memory-Efficient Storage of Large Data on Disk and Fast Access Functions |
Statements
12 January 2024
0 references
The ff package provides data structures that are stored ondisk but behave (almost) as if they were in RAM by transparently mapping only a section (pagesize) in main memory - the effective virtual memory consumption per ff object. ff supports R's standard atomic data types 'double', 'logical', 'raw' and 'integer' and non-standard atomic types boolean (1 bit), quad (2 bit unsigned), nibble (4 bit unsigned), byte (1 byte signed with NAs), ubyte (1 byte unsigned), short (2 byte signed with NAs), ushort (2 byte unsigned), single (4 byte float with NAs). For example 'quad' allows efficient storage of genomic data as an 'A','T','G','C' factor. The unsigned types support 'circular' arithmetic. There is also support for close-to-atomic types 'factor', 'ordered', 'POSIXct', 'Date' and custom close-to-atomic types. ff not only has native C-support for vectors, matrices and arrays with flexible dimorder (major column-order, major row-order and generalizations for arrays). There is also a ffdf class not unlike data.frames and import/export filters for csv files.ff objects store raw data in binary flat files in native encoding,and complement this with metadata stored in R as physical and virtualattributes. ff objects have well-defined hybrid copying semantics, which gives rise to certain performance improvements through virtualization. ff objects can be stored and reopened across R sessions. ff files can be shared by multiple ff R objects (using different data en/de-coding schemes) in the same process or from multiple R processes to exploit parallelism. A wide choice of finalizer options allows to work with 'permanent' files as well as creating/removing 'temporary' ff files completely transparent to the user. On certain OS/Filesystem combinations, creating the ff filesworks without notable delay thanks to using sparse file allocation.Several access optimization techniques such as Hybrid Index Preprocessing and Virtualization are implemented to achieve good performance even with large datasets, for example virtual matrix transpose without touching a single byte on disk. Further, to reduce disk I/O, 'logicals' and non-standard data types get stored native and compact on binary flat files i.e. logicals take up exactly 2 bits to represent TRUE, FALSE and NA. Beyond basic access functions, the ff package also provides compatibility functions that facilitate writing code for ff and ram objects and support for batch processing on ff objects (e.g. as.ram, as.ff, ffapply). ff interfaces closely with functionality from package 'bit': chunked looping, fast bit operations and coercions between different objects that can store subscript information ('bit', 'bitwhich', ff 'boolean', ri range index, hi hybrid index). This allowsto work interactively with selections of large datasets and quickly modify selection criteria. Further high-performance enhancements can be made available upon request.
0 references
Identifiers
26 January 2024
0 references