mirrors/git - Incest Forge: Beyond sex. We incest.

mirrors/git

mirror of https://github.com/git/git.git synced 2024-11-05 08:47:56 +01:00

558 lines

12 KiB

C

Raw Normal View History

streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`/*`
			`* Copyright (c) 2011, Google Inc.`
			`*/`
			`#include "cache.h"`
			`#include "streaming.h"`
sha1_file: add repository argument to map_sha1_file Add a repository argument to allow map_sha1_file callers to be more specific about which repository to handle. This is a small mechanical change; it doesn't change the implementation to handle repositories other than the_repository yet. As with the previous commits, use a macro to catch callers passing a repository other than the_repository at compile time. While at it, move the declaration to object-store.h, where it should be easier to find. Signed-off-by: Stefan Beller <sbeller@google.com> Signed-off-by: Jonathan Nieder <jrnieder@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2018-03-23 18:21:14 +01:00			`#include "repository.h"`
			`#include "object-store.h"`
pack: move use_pack() The function open_packed_git() needs to be temporarily made global. Its scope will be restored to static in a subsequent commit. Signed-off-by: Jonathan Tan <jonathantanmy@google.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2017-08-19 00:20:22 +02:00			`#include "packfile.h"`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00
			`enum input_source {`
			`stream_error = -1,`
			`incore = 0,`
			`loose = 1,`
			`pack_non_delta = 2`
			`};`

			`typedef int (open_istream_fn)(struct git_istream ,`
			`struct object_info *,`
streaming: convert istream internals to struct object_id Convert the various open_istream variants to take a pointer to struct object_id. Introduce a temporary, which will be removed later, to work around the fact that lookup_replace_object still returns a pointer to unsigned char. Signed-off-by: brian m. carlson <sandals@crustytoothpaste.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2018-03-12 03:27:50 +01:00			`const struct object_id *,`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`enum object_type *);`
			`typedef int (close_istream_fn)(struct git_istream );`
			`typedef ssize_t (read_istream_fn)(struct git_istream , char *, size_t);`

			`struct stream_vtbl {`
			`close_istream_fn close;`
			`read_istream_fn read;`
			`};`

			`#define open_method_decl(name) \`
			`int open_istream_ ##name \`
			`(struct git_istream st, struct object_info oi, \`
streaming: convert istream internals to struct object_id Convert the various open_istream variants to take a pointer to struct object_id. Introduce a temporary, which will be removed later, to work around the fact that lookup_replace_object still returns a pointer to unsigned char. Signed-off-by: brian m. carlson <sandals@crustytoothpaste.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2018-03-12 03:27:50 +01:00			`const struct object_id *oid, \`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`enum object_type *type)`

			`#define close_method_decl(name) \`
			`int close_istream_ ##name \`
			`(struct git_istream *st)`

			`#define read_method_decl(name) \`
			`ssize_t read_istream_ ##name \`
			`(struct git_istream st, char buf, size_t sz)`

			`/* forward declaration */`
			`static open_method_decl(incore);`
			`static open_method_decl(loose);`
			`static open_method_decl(pack_non_delta);`
Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00			`static struct git_istream attach_stream_filter(struct git_istream st,`
			`struct stream_filter *filter);`

streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00
			`static open_istream_fn open_istream_tbl[] = {`
			`open_istream_incore,`
			`open_istream_loose,`
			`open_istream_pack_non_delta,`
			`};`

Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00			`#define FILTER_BUFFER (1024*16)`

			`struct filtered_istream {`
			`struct git_istream *upstream;`
			`struct stream_filter *filter;`
			`char ibuf[FILTER_BUFFER];`
			`char obuf[FILTER_BUFFER];`
			`int i_end, i_ptr;`
			`int o_end, o_ptr;`
stream filter: add "no more input" to the filters Some filters may need to buffer the input and look-ahead inside it to decide what to output, and they may consume more than zero bytes of input and still not produce any output. After feeding all the input, pass NULL as input as keep calling stream_filter() to let such filters know there is no more input coming, and it is time for them to produce the remaining output based on the buffered input. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-21 23:05:51 +02:00			`int input_finished;`
Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00			`};`

streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`struct git_istream {`
			`const struct stream_vtbl *vtbl;`
			`unsigned long size; /* inflated size of full object */`
Merge branch 'jc/zlib-wrap' * jc/zlib-wrap: zlib: allow feeding more than 4GB in one go zlib: zlib can only process 4GB at a time zlib: wrap deflateBound() too zlib: wrap deflate side of the API zlib: wrap inflateInit2 used to accept only for gzip format zlib: wrap remaining calls to direct inflate/inflateEnd zlib wrapper: refactor error message formatter Conflicts: sha1_file.c 2011-07-19 18:33:03 +02:00			`git_zstream z;`
streaming: read non-delta incrementally from a pack Helped-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-14 00:34:58 +02:00			`enum { z_unused, z_used, z_done, z_error } z_state;`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00
			`union {`
			`struct {`
			`char buf; / from read_object() */`
			`unsigned long read_ptr;`
			`} incore;`

			`struct {`
streaming: read loose objects incrementally Helped-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-15 04:17:10 +02:00			`void *mapped;`
			`unsigned long mapsize;`
			`char hdr[32];`
			`int hdr_avail;`
			`int hdr_used;`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`} loose;`

			`struct {`
streaming: read non-delta incrementally from a pack Helped-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-14 00:34:58 +02:00			`struct packed_git *pack;`
			`off_t pos;`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`} in_pack;`
Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00
			`struct filtered_istream filtered;`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`} u;`
			`};`

			`int close_istream(struct git_istream *st)`
			`{`
streaming: free git_istream upon closing Kirill Smelkov noticed that post-1.7.6 "git checkout" started leaking tons of memory. The streaming_write_entry function properly calls close_istream(), but that function did not actually free() the allocated git_istream struct. The git_istream struct is totally opaque to calling code, and must be heap-allocated by open_istream. Therefore it's not appropriate for callers to have to free it. This patch makes close_istream() into "close and de-allocate all associated resources". We could add a new "free_istream" call, but there's not much point in letting callers inspect the istream after close. And this patch's semantics make us match fopen/fclose, which is well-known and understood. Signed-off-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-07-22 19:00:03 +02:00			`int r = st->vtbl->close(st);`
			`free(st);`
			`return r;`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`}`

streaming: void pointer instead of char pointer Allow any kind of buffer to be fed to read_istream() without an explicit cast by making it's buf argument a void pointer. It's about arbitrary data, not only characters. Signed-off-by: Rene Scharfe <rene.scharfe@lsrfire.ath.cx> Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2012-05-03 03:51:00 +02:00			`ssize_t read_istream(struct git_istream st, void buf, size_t sz)`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`{`
			`return st->vtbl->read(st, buf, sz);`
			`}`

Convert lookup_replace_object to struct object_id Convert both the argument and the return value to be pointers to struct object_id. Update the callers and their internals to deal with the new type. Remove several temporaries which are no longer needed. Signed-off-by: brian m. carlson <sandals@crustytoothpaste.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2018-03-12 03:27:54 +01:00			`static enum input_source istream_source(const struct object_id *oid,`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`enum object_type *type,`
			`struct object_info *oi)`
			`{`
			`unsigned long size;`
			`int status;`

sha1_object_info_extended: make type calculation optional Each caller of sha1_object_info_extended sets up an object_info struct to tell the function which elements of the object it wants to get. Until now, getting the type of the object has always been required (and it is returned via the return type rather than a pointer in object_info). This can involve actually opening a loose object file to determine its type, or following delta chains to determine a packed file's base type. These effects produce a measurable slow-down when doing a "cat-file --batch-check" that does not include %(objecttype). This patch adds a "typep" query to struct object_info, so that it can be optionally queried just like size and disk_size. As a result, the return type of the function is no longer the object type, but rather 0/-1 for success/error. As there are only three callers total, we just fix up each caller rather than keep a compatibility wrapper: 1. The simpler sha1_object_info wrapper continues to always ask for and return the type field. 2. The istream_source function wants to know the type, and so always asks for it. 3. The cat-file batch code asks for the type only when %(objecttype) is part of the format string. On linux.git, the best-of-five for running: $ git rev-list --objects --all >objects $ time git cat-file --batch-check='%(objectsize:disk)' on a fully packed repository goes from: real 0m8.680s user 0m8.160s sys 0m0.512s to: real 0m7.205s user 0m6.580s sys 0m0.608s Signed-off-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2013-07-12 08:34:57 +02:00			`oi->typep = type;`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`oi->sizep = &size;`
Convert lookup_replace_object to struct object_id Convert both the argument and the return value to be pointers to struct object_id. Update the callers and their internals to deal with the new type. Remove several temporaries which are no longer needed. Signed-off-by: brian m. carlson <sandals@crustytoothpaste.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2018-03-12 03:27:54 +01:00			`status = oid_object_info_extended(oid, oi, 0);`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`if (status < 0)`
			`return stream_error;`

			`switch (oi->whence) {`
			`case OI_LOOSE:`
			`return loose;`
			`case OI_PACKED:`
pack-objects, streaming: turn "xx >= big_file_threshold" to ".. > .." This is because all other places do "xx > big_file_threshold" Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2012-05-16 14:02:09 +02:00			`if (!oi->u.packed.is_delta && big_file_threshold < size)`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`return pack_non_delta;`
			`/* fallthru */`
			`default:`
			`return incore;`
			`}`
			`}`

streaming: convert open_istream to use struct object_id Signed-off-by: brian m. carlson <sandals@crustytoothpaste.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2018-03-12 03:27:40 +01:00			`struct git_istream open_istream(const struct object_id oid,`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`enum object_type *type,`
Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00			`unsigned long *size,`
			`struct stream_filter *filter)`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`{`
			`struct git_istream *st;`
provide an initializer for "struct object_info" An all-zero initializer is fine for this struct, but because the first element is a pointer, call sites need to know to use "NULL" instead of "0". Otherwise some static checkers like "sparse" will complain; see d099b71 (Fix some sparse warnings, 2013-07-18) for example. So let's provide an initializer to make this easier to get right. But let's also comment that memset() to zero is explicitly OK[1]. One of the callers embeds object_info in another struct which is initialized via memset (expand_data in builtin/cat-file.c). Since our subset of C doesn't allow assignment from a compound literal, handling this in any other way is awkward, so we'd like to keep the ability to initialize by memset(). By documenting this property, it should make anybody who wants to change the initializer think twice before doing so. There's one other caller of interest. In parse_sha1_header(), we did not initialize the struct fully in the first place. This turned out not to be a bug because the sub-function it calls does not look at any other fields except the ones we did initialize. But that assumption might not hold in the future, so it's a dangerous construct. This patch switches it to initializing the whole struct, which protects us against unexpected reads of the other fields. [1] Obviously using memset() to initialize a pointer violates the C standard, but we long ago decided that it was an acceptable tradeoff in the real world. Signed-off-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2016-08-11 11:24:35 +02:00			`struct object_info oi = OBJECT_INFO_INIT;`
Convert lookup_replace_object to struct object_id Convert both the argument and the return value to be pointers to struct object_id. Update the callers and their internals to deal with the new type. Remove several temporaries which are no longer needed. Signed-off-by: brian m. carlson <sandals@crustytoothpaste.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2018-03-12 03:27:54 +01:00			`const struct object_id *real = lookup_replace_object(oid);`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`enum input_source src = istream_source(real, type, &oi);`

			`if (src < 0)`
			`return NULL;`

			`st = xmalloc(sizeof(*st));`
Convert lookup_replace_object to struct object_id Convert both the argument and the return value to be pointers to struct object_id. Update the callers and their internals to deal with the new type. Remove several temporaries which are no longer needed. Signed-off-by: brian m. carlson <sandals@crustytoothpaste.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2018-03-12 03:27:54 +01:00			`if (open_istream_tbl[src](st, &oi, real, type)) {`
			`if (open_istream_incore(st, &oi, real, type)) {`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`free(st);`
			`return NULL;`
			`}`
			`}`
open_istream: remove unneeded check for null pointer 'st' is allocated via xmalloc a few lines before and passed to the stream opening functions. The xmalloc function is written in a way that either 'st' is allocated valid memory or xmalloc already dies. The function calls to open_istream_* do not change 'st', as the pointer is passed by reference and not a pointer of a pointer. Hence 'st' cannot be null at that part of the code. Signed-off-by: Stefan Beller <stefanbeller@googlemail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2013-07-23 15:16:04 +02:00			`if (filter) {`
Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00			`/* Add "&& !is_null_stream_filter(filter)" for performance */`
			`struct git_istream *nst = attach_stream_filter(st, filter);`
open_istream(): do not dereference NULL in the error case When stream-filter cannot be attached, it is expected to return NULL, and we should close the stream we opened and signal an error by returning NULL ourselves from this function. However, we attempted to dereference that NULL pointer between the point we detected the error and returned from the function. Brought-to-attention-by: John Keeping <john@keeping.me.uk> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2014-02-19 01:00:53 +01:00			`if (!nst) {`
Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00			`close_istream(st);`
open_istream(): do not dereference NULL in the error case When stream-filter cannot be attached, it is expected to return NULL, and we should close the stream we opened and signal an error by returning NULL ourselves from this function. However, we attempted to dereference that NULL pointer between the point we detected the error and returned from the function. Brought-to-attention-by: John Keeping <john@keeping.me.uk> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2014-02-19 01:00:53 +01:00			`return NULL;`
			`}`
Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00			`st = nst;`
			`}`

streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`*size = st->size;`
			`return st;`
			`}`

streaming: read non-delta incrementally from a pack Helped-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-14 00:34:58 +02:00
			`/*****************************************************************`
			`*`
			`* Common helpers`
			`*`
			`*****************************************************************/`

			`static void close_deflated_stream(struct git_istream *st)`
			`{`
			`if (st->z_state == z_used)`
			`git_inflate_end(&st->z);`
			`}`


Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00			`/*****************************************************************`
			`*`
			`* Filtered stream`
			`*`
			`*****************************************************************/`

			`static close_method_decl(filtered)`
			`{`
			`free_stream_filter(st->u.filtered.filter);`
			`return close_istream(st->u.filtered.upstream);`
			`}`

			`static read_method_decl(filtered)`
			`{`
			`struct filtered_istream *fs = &(st->u.filtered);`
			`size_t filled = 0;`

			`while (sz) {`
			`/* do we already have filtered output? */`
			`if (fs->o_ptr < fs->o_end) {`
			`size_t to_move = fs->o_end - fs->o_ptr;`
			`if (sz < to_move)`
			`to_move = sz;`
			`memcpy(buf + filled, fs->obuf + fs->o_ptr, to_move);`
			`fs->o_ptr += to_move;`
			`sz -= to_move;`
			`filled += to_move;`
			`continue;`
			`}`
			`fs->o_end = fs->o_ptr = 0;`

			`/* do we have anything to feed the filter with? */`
			`if (fs->i_ptr < fs->i_end) {`
			`size_t to_feed = fs->i_end - fs->i_ptr;`
			`size_t to_receive = FILTER_BUFFER;`
			`if (stream_filter(fs->filter,`
			`fs->ibuf + fs->i_ptr, &to_feed,`
			`fs->obuf, &to_receive))`
			`return -1;`
			`fs->i_ptr = fs->i_end - to_feed;`
			`fs->o_end = FILTER_BUFFER - to_receive;`
			`continue;`
			`}`
stream filter: add "no more input" to the filters Some filters may need to buffer the input and look-ahead inside it to decide what to output, and they may consume more than zero bytes of input and still not produce any output. After feeding all the input, pass NULL as input as keep calling stream_filter() to let such filters know there is no more input coming, and it is time for them to produce the remaining output based on the buffered input. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-21 23:05:51 +02:00
			`/* tell the filter to drain upon no more input */`
			`if (fs->input_finished) {`
			`size_t to_receive = FILTER_BUFFER;`
			`if (stream_filter(fs->filter,`
			`NULL, NULL,`
			`fs->obuf, &to_receive))`
			`return -1;`
			`fs->o_end = FILTER_BUFFER - to_receive;`
			`if (!fs->o_end)`
			`break;`
			`continue;`
			`}`
Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00			`fs->i_end = fs->i_ptr = 0;`

			`/* refill the input from the upstream */`
stream filter: add "no more input" to the filters Some filters may need to buffer the input and look-ahead inside it to decide what to output, and they may consume more than zero bytes of input and still not produce any output. After feeding all the input, pass NULL as input as keep calling stream_filter() to let such filters know there is no more input coming, and it is time for them to produce the remaining output based on the buffered input. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-21 23:05:51 +02:00			`if (!fs->input_finished) {`
			`fs->i_end = read_istream(fs->upstream, fs->ibuf, FILTER_BUFFER);`
			`if (fs->i_end < 0)`
read_istream_filtered: propagate read error from upstream The filter istream pulls data from an "upstream" stream, running it through a filter function. However, we did not properly notice when the upstream filter yielded an error, and just returned what we had read. Instead, we should propagate the error. Signed-off-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2013-03-25 21:18:16 +01:00			`return -1;`
stream filter: add "no more input" to the filters Some filters may need to buffer the input and look-ahead inside it to decide what to output, and they may consume more than zero bytes of input and still not produce any output. After feeding all the input, pass NULL as input as keep calling stream_filter() to let such filters know there is no more input coming, and it is time for them to produce the remaining output based on the buffered input. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-21 23:05:51 +02:00			`if (fs->i_end)`
			`continue;`
			`}`
			`fs->input_finished = 1;`
Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00			`}`
			`return filled;`
			`}`

			`static struct stream_vtbl filtered_vtbl = {`
			`close_istream_filtered,`
			`read_istream_filtered,`
			`};`

			`static struct git_istream attach_stream_filter(struct git_istream st,`
			`struct stream_filter *filter)`
			`{`
			`struct git_istream ifs = xmalloc(sizeof(ifs));`
			`struct filtered_istream *fs = &(ifs->u.filtered);`

			`ifs->vtbl = &filtered_vtbl;`
			`fs->upstream = st;`
			`fs->filter = filter;`
			`fs->i_end = fs->i_ptr = 0;`
			`fs->o_end = fs->o_ptr = 0;`
stream filter: add "no more input" to the filters Some filters may need to buffer the input and look-ahead inside it to decide what to output, and they may consume more than zero bytes of input and still not produce any output. After feeding all the input, pass NULL as input as keep calling stream_filter() to let such filters know there is no more input coming, and it is time for them to produce the remaining output based on the buffered input. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-21 23:05:51 +02:00			`fs->input_finished = 0;`
Add streaming filter API This introduces an API to plug custom filters to an input stream. The caller gets get_stream_filter("path") to obtain an appropriate filter for the path, and then uses it when opening an input stream via open_istream(). After that, the caller can read from the stream with read_istream(), and close it with close_istream(), just like an unfiltered stream. This only adds a "null" filter that is a pass-thru filter, but later changes can add LF-to-CRLF and other filters, and the callers of the streaming API do not have to change. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-20 23:33:31 +02:00			`ifs->size = -1; /* unknown */`
			`return ifs;`
			`}`

streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`/*****************************************************************`
			`*`
			`* Loose object stream`
			`*`
			`*****************************************************************/`

streaming: read loose objects incrementally Helped-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-15 04:17:10 +02:00			`static read_method_decl(loose)`
			`{`
			`size_t total_read = 0;`

			`switch (st->z_state) {`
			`case z_done:`
			`return 0;`
			`case z_error:`
			`return -1;`
			`default:`
			`break;`
			`}`

			`if (st->u.loose.hdr_used < st->u.loose.hdr_avail) {`
			`size_t to_copy = st->u.loose.hdr_avail - st->u.loose.hdr_used;`
			`if (sz < to_copy)`
			`to_copy = sz;`
			`memcpy(buf, st->u.loose.hdr + st->u.loose.hdr_used, to_copy);`
			`st->u.loose.hdr_used += to_copy;`
			`total_read += to_copy;`
			`}`

			`while (total_read < sz) {`
			`int status;`

			`st->z.next_out = (unsigned char *)buf + total_read;`
			`st->z.avail_out = sz - total_read;`
			`status = git_inflate(&st->z, Z_FINISH);`

			`total_read = st->z.next_out - (unsigned char *)buf;`

			`if (status == Z_STREAM_END) {`
			`git_inflate_end(&st->z);`
			`st->z_state = z_done;`
			`break;`
			`}`
avoid infinite loop in read_istream_loose The read_istream_loose function loops on inflating a chunk of data from an mmap'd loose object. We end the loop when we run out of space in our output buffer, or if we see a zlib error. We need to treat Z_BUF_ERROR specially, though, as it is not fatal; it is just zlib's way of telling us that we need to either feed it more input or give it more output space. It is perfectly normal for us to hit this when we are at the end of our buffer. However, we may also get Z_BUF_ERROR because we have run out of input. In a well-formed object, this should not happen, because we have fed the whole mmap'd contents to zlib. But if the object is truncated or corrupt, we will loop forever, never giving zlib any more data, but continuing to ask it to inflate. We can fix this by considering it an error when zlib returns Z_BUF_ERROR but we still have output space left (which means it must want more input, which we know is a truncation error). It would not be sufficient to just check whether zlib had consumed all the input at the start of the loop, as it might still want to generate output from what is in its internal state. Signed-off-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2013-03-25 21:21:14 +01:00			`if (status != Z_OK && (status != Z_BUF_ERROR \|\| total_read < sz)) {`
streaming: read loose objects incrementally Helped-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-15 04:17:10 +02:00			`git_inflate_end(&st->z);`
			`st->z_state = z_error;`
			`return -1;`
			`}`
			`}`
			`return total_read;`
			`}`

			`static close_method_decl(loose)`
			`{`
			`close_deflated_stream(st);`
			`munmap(st->u.loose.mapped, st->u.loose.mapsize);`
			`return 0;`
			`}`

			`static struct stream_vtbl loose_vtbl = {`
			`close_istream_loose,`
			`read_istream_loose,`
			`};`

streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`static open_method_decl(loose)`
			`{`
sha1_file: add repository argument to map_sha1_file Add a repository argument to allow map_sha1_file callers to be more specific about which repository to handle. This is a small mechanical change; it doesn't change the implementation to handle repositories other than the_repository yet. As with the previous commits, use a macro to catch callers passing a repository other than the_repository at compile time. While at it, move the declaration to object-store.h, where it should be easier to find. Signed-off-by: Stefan Beller <sbeller@google.com> Signed-off-by: Jonathan Nieder <jrnieder@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2018-03-23 18:21:14 +01:00			`st->u.loose.mapped = map_sha1_file(the_repository,`
Merge branch 'sb/object-store' Refactoring the internal global data structure to make it possible to open multiple repositories, work with and then close them. Rerolled by Duy on top of a separate preliminary clean-up topic. The resulting structure of the topics looked very sensible. * sb/object-store: (27 commits) sha1_file: allow sha1_loose_object_info to handle arbitrary repositories sha1_file: allow map_sha1_file to handle arbitrary repositories sha1_file: allow map_sha1_file_1 to handle arbitrary repositories sha1_file: allow open_sha1_file to handle arbitrary repositories sha1_file: allow stat_sha1_file to handle arbitrary repositories sha1_file: allow sha1_file_name to handle arbitrary repositories sha1_file: add repository argument to sha1_loose_object_info sha1_file: add repository argument to map_sha1_file sha1_file: add repository argument to map_sha1_file_1 sha1_file: add repository argument to open_sha1_file sha1_file: add repository argument to stat_sha1_file sha1_file: add repository argument to sha1_file_name sha1_file: allow prepare_alt_odb to handle arbitrary repositories sha1_file: allow link_alt_odb_entries to handle arbitrary repositories sha1_file: add repository argument to prepare_alt_odb sha1_file: add repository argument to link_alt_odb_entries sha1_file: add repository argument to read_info_alternates sha1_file: add repository argument to link_alt_odb_entry sha1_file: add raw_object_store argument to alt_odb_usable pack: move approximate object count to object store ... 2018-04-11 06:09:55 +02:00			`oid->hash, &st->u.loose.mapsize);`
streaming: read loose objects incrementally Helped-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-15 04:17:10 +02:00			`if (!st->u.loose.mapped)`
			`return -1;`
streaming: make sure to notice corrupt object The streaming read interface from a loose object called parse_sha1_header() but discarded its return value, without noticing a potential error. Signed-off-by: Junio C Hamano <gitster@pobox.com> 2016-09-26 18:23:41 +02:00			`if ((unpack_sha1_header(&st->z,`
			`st->u.loose.mapped,`
			`st->u.loose.mapsize,`
			`st->u.loose.hdr,`
			`sizeof(st->u.loose.hdr)) < 0) \|\|`
			`(parse_sha1_header(st->u.loose.hdr, &st->size) < 0)) {`
streaming: read loose objects incrementally Helped-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-15 04:17:10 +02:00			`git_inflate_end(&st->z);`
			`munmap(st->u.loose.mapped, st->u.loose.mapsize);`
			`return -1;`
			`}`

			`st->u.loose.hdr_used = strlen(st->u.loose.hdr) + 1;`
			`st->u.loose.hdr_avail = st->z.total_out;`
			`st->z_state = z_used;`

			`st->vtbl = &loose_vtbl;`
			`return 0;`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`}`


			`/*****************************************************************`
			`*`
			`* Non-delta packed object stream`
			`*`
			`*****************************************************************/`

streaming: read non-delta incrementally from a pack Helped-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-14 00:34:58 +02:00			`static read_method_decl(pack_non_delta)`
			`{`
			`size_t total_read = 0;`

			`switch (st->z_state) {`
			`case z_unused:`
			`memset(&st->z, 0, sizeof(st->z));`
			`git_inflate_init(&st->z);`
			`st->z_state = z_used;`
			`break;`
			`case z_done:`
			`return 0;`
			`case z_error:`
			`return -1;`
			`case z_used:`
			`break;`
			`}`

			`while (total_read < sz) {`
			`int status;`
			`struct pack_window *window = NULL;`
			`unsigned char *mapped;`

			`mapped = use_pack(st->u.in_pack.pack, &window,`
			`st->u.in_pack.pos, &st->z.avail_in);`

			`st->z.next_out = (unsigned char *)buf + total_read;`
			`st->z.avail_out = sz - total_read;`
			`st->z.next_in = mapped;`
			`status = git_inflate(&st->z, Z_FINISH);`

			`st->u.in_pack.pos += st->z.next_in - mapped;`
			`total_read = st->z.next_out - (unsigned char *)buf;`
			`unuse_pack(&window);`

			`if (status == Z_STREAM_END) {`
			`git_inflate_end(&st->z);`
			`st->z_state = z_done;`
			`break;`
			`}`
			`if (status != Z_OK && status != Z_BUF_ERROR) {`
			`git_inflate_end(&st->z);`
			`st->z_state = z_error;`
			`return -1;`
			`}`
			`}`
			`return total_read;`
			`}`

			`static close_method_decl(pack_non_delta)`
			`{`
			`close_deflated_stream(st);`
			`return 0;`
			`}`

			`static struct stream_vtbl pack_non_delta_vtbl = {`
			`close_istream_pack_non_delta,`
			`read_istream_pack_non_delta,`
			`};`

streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`static open_method_decl(pack_non_delta)`
			`{`
streaming: read non-delta incrementally from a pack Helped-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-14 00:34:58 +02:00			`struct pack_window *window;`
			`enum object_type in_pack_type;`

			`st->u.in_pack.pack = oi->u.packed.pack;`
			`st->u.in_pack.pos = oi->u.packed.offset;`
			`window = NULL;`

			`in_pack_type = unpack_object_header(st->u.in_pack.pack,`
			`&window,`
			`&st->u.in_pack.pos,`
			`&st->size);`
			`unuse_pack(&window);`
			`switch (in_pack_type) {`
			`default:`
			`return -1; /* we do not do deltas for now */`
			`case OBJ_COMMIT:`
			`case OBJ_TREE:`
			`case OBJ_BLOB:`
			`case OBJ_TAG:`
			`break;`
			`}`
			`st->z_state = z_unused;`
			`st->vtbl = &pack_non_delta_vtbl;`
			`return 0;`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`}`


			`/*****************************************************************`
			`*`
			`* In-core stream`
			`*`
			`*****************************************************************/`

			`static close_method_decl(incore)`
			`{`
			`free(st->u.incore.buf);`
			`return 0;`
			`}`

			`static read_method_decl(incore)`
			`{`
			`size_t read_size = sz;`
			`size_t remainder = st->size - st->u.incore.read_ptr;`

			`if (remainder <= read_size)`
			`read_size = remainder;`
			`if (read_size) {`
			`memcpy(buf, st->u.incore.buf + st->u.incore.read_ptr, read_size);`
			`st->u.incore.read_ptr += read_size;`
			`}`
			`return read_size;`
			`}`

			`static struct stream_vtbl incore_vtbl = {`
			`close_istream_incore,`
			`read_istream_incore,`
			`};`

			`static open_method_decl(incore)`
			`{`
sha1_file: convert read_sha1_file to struct object_id Convert read_sha1_file to take a pointer to struct object_id and rename it read_object_file. Do the same for read_sha1_file_extended. Convert one use in grep.c to use the new function without any other code change, since the pointer being passed is a void pointer that is already initialized with a pointer to struct object_id. Update the declaration and definitions of the modified functions, and apply the following semantic patch to convert the remaining callers: @@ expression E1, E2, E3; @@ - read_sha1_file(E1.hash, E2, E3) + read_object_file(&E1, E2, E3) @@ expression E1, E2, E3; @@ - read_sha1_file(E1->hash, E2, E3) + read_object_file(E1, E2, E3) @@ expression E1, E2, E3, E4; @@ - read_sha1_file_extended(E1.hash, E2, E3, E4) + read_object_file_extended(&E1, E2, E3, E4) @@ expression E1, E2, E3, E4; @@ - read_sha1_file_extended(E1->hash, E2, E3, E4) + read_object_file_extended(E1, E2, E3, E4) Signed-off-by: brian m. carlson <sandals@crustytoothpaste.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2018-03-12 03:27:53 +01:00			`st->u.incore.buf = read_object_file_extended(oid, type, &st->size, 0);`
streaming: a new API to read from the object store Given an object name, use open_istream() to get a git_istream handle that you can read_istream() from as if you are using read(2) to read the contents of the object, and close it with close_istream() when you are done. Currently, we do not do anything fancy--it just calls read_sha1_file() and keeps the contents in memory as a whole, and carve it out as you request with read_istream(). Signed-off-by: Junio C Hamano <gitster@pobox.com> 2011-05-12 04:30:25 +02:00			`st->u.incore.read_ptr = 0;`
			`st->vtbl = &incore_vtbl;`

			`return st->u.incore.buf ? 0 : -1;`
			`}`
streaming: make streaming-write-entry to be more reusable The static function in entry.c takes a cache entry and streams its blob contents to a file in the working tree. Refactor the logic to a new API function stream_blob_to_fd() that takes an object name and an open file descriptor, so that it can be reused by other callers. Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2012-03-07 11:54:15 +01:00

			`/****************************************************************`
			`* Users of streaming interface`
			`****************************************************************/`

streaming: make stream_blob_to_fd take struct object_id Since all of its callers have been updated, modify stream_blob_to_fd to take a struct object_id. Signed-off-by: brian m. carlson <sandals@crustytoothpaste.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2016-09-05 22:07:59 +02:00			`int stream_blob_to_fd(int fd, const struct object_id oid, struct stream_filter filter,`
streaming: make streaming-write-entry to be more reusable The static function in entry.c takes a cache entry and streams its blob contents to a file in the working tree. Refactor the logic to a new API function stream_blob_to_fd() that takes an object name and an open file descriptor, so that it can be reused by other callers. Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2012-03-07 11:54:15 +01:00			`int can_seek)`
			`{`
			`struct git_istream *st;`
			`enum object_type type;`
			`unsigned long sz;`
			`ssize_t kept = 0;`
			`int result = -1;`

streaming: convert open_istream to use struct object_id Signed-off-by: brian m. carlson <sandals@crustytoothpaste.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2018-03-12 03:27:40 +01:00			`st = open_istream(oid, &type, &sz, filter);`
streaming.c: fix a memleak When stream_blob_to_fd() opens an input stream with a filter, the filter gets discarded upon calling close_istream() before the function returns in the normal case. However, when we fail to open the stream, we failed to discard the filter. By discarding the filter in the failure case, give a consistent life-time rule of the filter to the callers; otherwise the callers need to conditionally discard the filter themselves, and this function does not give enough hint for the caller to do so correctly. Signed-off-by: John Keeping <john@keeping.me.uk> Signed-off-by: Stefan Beller <sbeller@google.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2015-03-31 03:22:11 +02:00			`if (!st) {`
			`if (filter)`
			`free_stream_filter(filter);`
streaming: make streaming-write-entry to be more reusable The static function in entry.c takes a cache entry and streams its blob contents to a file in the working tree. Refactor the logic to a new API function stream_blob_to_fd() that takes an object name and an open file descriptor, so that it can be reused by other callers. Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2012-03-07 11:54:15 +01:00			`return result;`
streaming.c: fix a memleak When stream_blob_to_fd() opens an input stream with a filter, the filter gets discarded upon calling close_istream() before the function returns in the normal case. However, when we fail to open the stream, we failed to discard the filter. By discarding the filter in the failure case, give a consistent life-time rule of the filter to the callers; otherwise the callers need to conditionally discard the filter themselves, and this function does not give enough hint for the caller to do so correctly. Signed-off-by: John Keeping <john@keeping.me.uk> Signed-off-by: Stefan Beller <sbeller@google.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2015-03-31 03:22:11 +02:00			`}`
streaming: make streaming-write-entry to be more reusable The static function in entry.c takes a cache entry and streams its blob contents to a file in the working tree. Refactor the logic to a new API function stream_blob_to_fd() that takes an object name and an open file descriptor, so that it can be reused by other callers. Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2012-03-07 11:54:15 +01:00			`if (type != OBJ_BLOB)`
			`goto close_and_exit;`
			`for (;;) {`
			`char buf[1024 * 16];`
			`ssize_t wrote, holeto;`
			`ssize_t readlen = read_istream(st, buf, sizeof(buf));`

stream_blob_to_fd: detect errors reading from stream We call read_istream, but never check its return value for errors. This can lead to us looping infinitely, as we just keep trying to write "-1" bytes (and we do not notice the error, as we simply check that write_in_full reports the same number of bytes we fed it, which of course is also -1). Signed-off-by: Jeff King <peff@peff.net> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2013-03-25 21:16:50 +01:00			`if (readlen < 0)`
			`goto close_and_exit;`
streaming: make streaming-write-entry to be more reusable The static function in entry.c takes a cache entry and streams its blob contents to a file in the working tree. Refactor the logic to a new API function stream_blob_to_fd() that takes an object name and an open file descriptor, so that it can be reused by other callers. Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2012-03-07 11:54:15 +01:00			`if (!readlen)`
			`break;`
			`if (can_seek && sizeof(buf) == readlen) {`
			`for (holeto = 0; holeto < readlen; holeto++)`
			`if (buf[holeto])`
			`break;`
			`if (readlen == holeto) {`
			`kept += holeto;`
			`continue;`
			`}`
			`}`

			`if (kept && lseek(fd, kept, SEEK_CUR) == (off_t) -1)`
			`goto close_and_exit;`
			`else`
			`kept = 0;`
			`wrote = write_in_full(fd, buf, readlen);`

convert less-trivial versions of "write_in_full() != len" The prior commit converted many sites to check the return value of write_in_full() for negativity, rather than a mismatch with the input length. This patch covers similar cases, but where the return value is stored in an intermediate variable. These should get the same treatment, but they need to be reviewed more carefully since it would be a bug if the return value is stored in an unsigned type (which indeed, it is in one of the cases). Signed-off-by: Jeff King <peff@peff.net> Reviewed-by: Jonathan Nieder <jrnieder@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2017-09-13 19:16:28 +02:00			`if (wrote < 0)`
streaming: make streaming-write-entry to be more reusable The static function in entry.c takes a cache entry and streams its blob contents to a file in the working tree. Refactor the logic to a new API function stream_blob_to_fd() that takes an object name and an open file descriptor, so that it can be reused by other callers. Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2012-03-07 11:54:15 +01:00			`goto close_and_exit;`
			`}`
			`if (kept && (lseek(fd, kept - 1, SEEK_CUR) == (off_t) -1 \|\|`
prefer xwrite instead of write Our xwrite wrapper already deals with a few potential hazards, and are as such more robust. Prefer it instead of write to get the robustness benefits everywhere. Signed-off-by: Erik Faye-Lund <kusmabite@gmail.com> Reviewed-and-improved-by: Jonathan Nieder <jrnieder@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2014-01-17 15:17:09 +01:00			`xwrite(fd, "", 1) != 1))`
streaming: make streaming-write-entry to be more reusable The static function in entry.c takes a cache entry and streams its blob contents to a file in the working tree. Refactor the logic to a new API function stream_blob_to_fd() that takes an object name and an open file descriptor, so that it can be reused by other callers. Signed-off-by: Nguyễn Thái Ngọc Duy <pclouds@gmail.com> Signed-off-by: Junio C Hamano <gitster@pobox.com> 2012-03-07 11:54:15 +01:00			`goto close_and_exit;`
			`result = 0;`

			`close_and_exit:`
			`close_istream(st);`
			`return result;`
			`}`