ó <±ho(ãó¨•SrSSKJrJr SSKrSSKJr SSKJ r SSK JrJrJ r JrJr SSKJrJr "S S \SS9r"S S\SS9r"SS\ 5rS/rg)z Processor class for Chameleon. é)ÚOptionalÚUnionNé)ÚBatchFeature)Ú ImageInput)ÚMultiModalDataÚProcessingKwargsÚProcessorMixinÚ TextKwargsÚUnpack)ÚPreTokenizedInputÚ TextInputcó •\rSrSr%\\S'Srg)ÚChameleonTextKwargsé#Úreturn_for_text_completion©N)Ú__name__Ú __module__Ú__qualname__Ú__firstlineno__ÚboolÚ__annotations__Ú__static_attributes__róÚj/var/www/html/shao/venv/lib/python3.13/site-packages/transformers/models/chameleon/processing_chameleon.pyrr#s‡Ø $Ö$rrF)Útotalcó6•\rSrSr%\\S'SSSS.SS0S.rSrg ) ÚChameleonProcessorKwargsé'Útext_kwargsF)ÚpaddingrÚreturn_mm_token_type_idsÚreturn_tensorsÚpt)r!Ú common_kwargsrN)rrrrrrÚ _defaultsrrrrrr's+‡Ø$Ó$ðØ*/Ø(-ñ ð ˜dð ñ ƒIrrcóÄ^•\rSrSrSrSS/rSrSrSS\S\ 4U4S jjjr SS \\S\\ \\\\\\4S\\S \4SjjrSSjrSrSr\S5rSrU=r$)ÚChameleonProcessoré5a÷ Constructs a Chameleon processor which wraps a Chameleon image processor and a Chameleon tokenizer into a single processor. [`ChameleonProcessor`] offers all the functionalities of [`ChameleonImageProcessor`] and [`LlamaTokenizerFast`]. See the [`~ChameleonProcessor.__call__`] and [`~ChameleonProcessor.decode`] for more information. Args: image_processor ([`ChameleonImageProcessor`]): The image processor is a required input. tokenizer ([`LlamaTokenizerFast`]): The tokenizer is a required input. image_seq_length (`int`, *optional*, defaults to 1024): Sequence length of one image embedding. image_token (`str`, *optional*, defaults to `""`): The special token used to indicate image in the text. Úimage_processorÚ tokenizer)ÚLlamaTokenizerÚLlamaTokenizerFastÚChameleonImageProcessorÚimage_seq_lengthÚimage_tokencóX>•X0l[US5(aUROUUlURUR5Ul[US5(aUR OSUl[US5(aUROSUlURUR5UlURUR5Ul URUR5Ul URURUR/Ul[TU]5X5 g)Nr1Ú boi_tokenz Ú eoi_tokenz)r0Úhasattrr1Úconvert_tokens_to_idsÚimage_token_idr3Úimage_start_tokenr4Úimage_end_tokenÚimage_start_token_idÚimage_end_token_idÚ image_idsÚsuperÚ__init__)Úselfr+r,r0r1Ú __class__s €rr>ÚChameleonProcessor.__init__Lsõø€Ø 0ÔÜ4;¸IÀ}×4UÑ4U˜9×0Ò0Ð[fˆÔØ'×=Ñ=¸d×>NÑ>NÓOˆÔä#*¨9°k×#BÑ#BˆI×ÒÈð Ôô7>¸iÈ×6UÑ6U˜y×2Ò2Ð[cˆÔØ'×=Ñ=¸d×>NÑ>NÓOˆÔØ$-×$CÑ$CÀD×DZÑDZÓ$[ˆÔ!Ø"+×"AÑ"AÀ$×BVÑBVÓ"WˆÔØ×-Ñ-¨t×/HÑ/HÈ$×JaÑJaÐbˆŒä ‰Ñ˜Õ4rÚimagesÚtextÚkwargsÚreturncó8•[U[5(aU/nO8[U[5(d#[US[5(d[S5eUcUc[ S5eUR "[4SURR0UD6nUSRSS5n/nURURUR--UR-n UHOn U RURU 5n U(dX RR- n UR!U 5 MQ 0nUbUR""U40US D6nUSRS S5nUSRSS5n UR"U40USDS S0D6nUR%XŽS/S 9 U (ah[&R("US5n[&R*"US5nSU[&R,"XðR.5'UR15US'[30UEUEUS9$)aø Main method to prepare for the model one or several sequences(s) and image(s). This method forwards the `text` and `kwargs` arguments to LlamaTokenizerFast's [`~LlamaTokenizerFast.__call__`] if `text` is not `None` to encode the text. To prepare the image(s), this method forwards the `images` and `kwrags` arguments to CLIPImageProcessor's [`~CLIPImageProcessor.__call__`] if `images` is not `None`. Please refer to the docstring of the above two methods for more information. Args: images (`PIL.Image.Image`, `np.ndarray`, `torch.Tensor`, `list[PIL.Image.Image]`, `list[np.ndarray]`, `list[torch.Tensor]`): The image or batch of images to be prepared. Each image can be a PIL image, NumPy array or PyTorch tensor. Both channels-first and channels-last formats are supported. text (`str`, `list[str]`, `list[list[str]]`): The sequence or batch of sequences to be encoded. Each sequence can be a string or a list of strings (pretokenized string). If the sequences are provided as list of strings (pretokenized), you must set `is_split_into_words=True` (to lift the ambiguity with a batch of sequences). return_tensors (`str` or [`~utils.TensorType`], *optional*): If set, will return tensors of a particular framework. Acceptable values are: - `'tf'`: Return TensorFlow `tf.constant` objects. - `'pt'`: Return PyTorch `torch.Tensor` objects. - `'np'`: Return NumPy `np.ndarray` objects. - `'jax'`: Return JAX `jnp.ndarray` objects. Returns: [`BatchFeature`]: A [`BatchFeature`] with the following fields: - **input_ids** -- List of token ids to be fed to a model. Returned when `text` is not `None`. - **attention_mask** -- List of indices specifying which tokens should be attended to by the model (when `return_attention_mask=True` or if *"attention_mask"* is in `self.model_input_names` and if `text` is not `None`). - **pixel_values** -- Pixel values to be fed to a model. Returned when `images` is not `None`. rzAInvalid input text. Please provide a string, or a list of stringsNz&You must provide either text or imagesÚtokenizer_init_kwargsr!rFÚ images_kwargsr$r#Úimage)Ú modalitiesÚ input_idséÚmm_token_type_ids)ÚdataÚtensor_type)Ú isinstanceÚstrÚlistÚ TypeErrorÚ ValueErrorÚ _merge_kwargsrr,Úinit_kwargsÚpopr8r1r0r9ÚreplaceÚ sep_tokenÚappendr+Ú_check_special_mm_tokensÚnpÚarrayÚ zeros_likeÚisinr<Útolistr)r?rBrCÚaudioÚvideosrDÚ output_kwargsrÚprompt_stringsÚone_img_tokensÚsampleÚimage_inputsr$r#Útext_inputsÚ array_idsrMs rÚ__call__ÚChameleonProcessor.__call__[s€ôRdœC× Ñ Ø6‰DÜ˜D¤$×'Ñ'´ ¸4À¹7ÄC×0HÑ0HÜÐ_Ó`Ð`Ø‰<˜F™NÜÐEÓFÐFà×*Ò*Ü$ñ à"&§.¡.×"<Ñ"<ð ðñ ˆ ð &3°=Ñ%A×%EÑ%EÐFbÐdiÓ%jÐ"ðˆØ×/Ñ/°4×3CÑ3CÀd×F[ÑF[Ñ3[Ñ\Ð_c×_sÑ_sÑsˆÛˆFØ—^‘^ D×$4Ñ$4°nÓEˆFÞ-ØŸ.™.×2Ñ2Ñ2Ø×!Ñ! &Ö)ñ ðˆØÑØ×/Ò/°ÑY¸-ÈÑ:XÑYˆLà& }Ñ5×9Ñ9Ð:JÈDÓQˆØ#0°Ñ#?×#CÑ#CÐD^Ð`eÓ#fÐ Ø—n’n ^Ñi°}À]Ñ7SÑiÐdhÒiˆØ×%Ñ% nÈwÈiÐ%ÑXæ#ÜŸš ¨[Ñ!9Ó:ˆIÜ "§ ¢ ¨k¸+Ñ.FÓ GÐØDEÐœbŸgšg i·±Ó@ÑAØ/@×/GÑ/GÓ/IˆKÐ+Ñ,äÐ!@ KÐ!@°<Ð!@ÈnÑ]Ð]rcóž•0nUb>URS-/[U5-nS/[U5-nURXES.5 [S0UD6$)a{ Computes the number of placeholder tokens needed for multimodal inputs with the given sizes. Args: image_sizes (`list[list[int]]`, *optional*): The input sizes formatted as (height, width) per each image. Returns: `MultiModalData`: A `MultiModalData` object holding number of tokens per each of the provided input modalities, along with other useful data. érL)Únum_image_tokensÚnum_image_patchesr)r0ÚlenÚupdater)r?Úimage_sizesrDÚvision_datarnros rÚ_get_num_multimodal_tokensÚ-ChameleonProcessor._get_num_multimodal_tokens¬s_€ðˆØÑ"à $× 5Ñ 5¸Ñ 9Ð:¼SÀÓ=MÑMÐØ!" ¤c¨+Ó&6Ñ 6Ðà×ÑÐ4DÑmÔnäÑ, Ñ,Ð,rcó:•URR"U0UD6$)zª This method forwards all its arguments to LlamaTokenizerFast's [`~PreTrainedTokenizer.batch_decode`]. Please refer to the docstring of this method for more information. )r,Úbatch_decode©r?ÚargsrDs rrwÚChameleonProcessor.batch_decodeÄs€ð ~‰~×*Ò*¨DÐ;°FÑ;Ð;rcó:•URR"U0UD6$)z¤ This method forwards all its arguments to LlamaTokenizerFast's [`~PreTrainedTokenizer.decode`]. Please refer to the docstring of this method for more information. )r,Údecoderxs rr|ÚChameleonProcessor.decodeÌs€ð ~‰~×$Ò$ dÐ5¨fÑ5Ð5rcóš•URRnURRn[[RX-55$©N)r,Úmodel_input_namesr+rRÚdictÚfromkeys)r?Útokenizer_input_namesÚimage_processor_input_namess rr€Ú$ChameleonProcessor.model_input_namesÓs>€ð!%§¡× @Ñ @ÐØ&*×&:Ñ&:×&LÑ&LÐ#Ü”D—M‘MÐ"7Ñ"UÓVÓWÐWr)r9r;r<r0r8r:r1r7)iz)NNNNr)rrrrÚ__doc__Ú attributesÚtokenizer_classÚimage_processor_classÚintrQr>rrrrr rRrrrrjrtrwr|Úpropertyr€rÚ __classcell__)r@s@rr)r)5sÊø†ñð$$ [Ð1€JØ>€OØ5Ðñ 5ÀSð 5Ð^a÷ 5ð 5ð"(,ØhlØØñO^à˜Ñ$ðO^ðu˜YÐ(9¸4À ¹?ÈDÐQbÑLcÐcÑdÑeðO^ðÐ1Ñ2ð O^ð õO^ôb-ò0<ò6ðñXóöXrr))r†ÚtypingrrÚnumpyr\Úfeature_extraction_utilsrÚimage_utilsrÚprocessing_utilsrr r rrÚtokenization_utils_baser rrrr)Ú__all__rrrÚr”s`ðñ÷#ãå4Ý%÷õ÷Dô%˜*¨Eò%ôÐ/°uòôcX˜ôcXðL Ð r