BARALDI, LORENZO
 Distribuzione geografica
Continente #
NA - Nord America 16.509
AS - Asia 12.270
EU - Europa 10.574
SA - Sud America 1.118
AF - Africa 197
OC - Oceania 68
Continente sconosciuto - Info sul continente non disponibili 24
Totale 40.760
Nazione #
US - Stati Uniti d'America 16.058
IT - Italia 4.802
SG - Singapore 3.382
CN - Cina 3.028
GB - Regno Unito 1.736
HK - Hong Kong 1.429
VN - Vietnam 1.080
DE - Germania 881
BR - Brasile 849
TR - Turchia 825
SE - Svezia 633
KR - Corea 514
FR - Francia 425
FI - Finlandia 364
RU - Federazione Russa 335
ID - Indonesia 316
JP - Giappone 312
BD - Bangladesh 278
IN - India 274
CA - Canada 260
NL - Olanda 248
UA - Ucraina 177
ES - Italia 164
MX - Messico 130
TW - Taiwan 119
AT - Austria 107
IE - Irlanda 104
MY - Malesia 100
BG - Bulgaria 93
TH - Thailandia 89
AR - Argentina 88
IQ - Iraq 82
BE - Belgio 76
PL - Polonia 72
CH - Svizzera 71
PH - Filippine 67
AU - Australia 59
ZA - Sudafrica 56
PK - Pakistan 51
RO - Romania 47
AE - Emirati Arabi Uniti 42
SA - Arabia Saudita 42
LT - Lituania 40
EC - Ecuador 39
GR - Grecia 39
CL - Cile 38
IL - Israele 38
DK - Danimarca 34
PT - Portogallo 31
CO - Colombia 29
KE - Kenya 27
VE - Venezuela 27
UZ - Uzbekistan 26
TN - Tunisia 23
DZ - Algeria 20
JO - Giordania 20
MA - Marocco 20
NP - Nepal 20
EU - Europa 19
IR - Iran 19
CZ - Repubblica Ceca 18
PE - Perù 18
KZ - Kazakistan 17
EG - Egitto 16
JM - Giamaica 15
AZ - Azerbaigian 13
PY - Paraguay 13
ET - Etiopia 10
SY - Repubblica araba siriana 10
OM - Oman 9
UY - Uruguay 9
AL - Albania 8
BZ - Belize 8
HU - Ungheria 8
KH - Cambogia 8
LU - Lussemburgo 8
NZ - Nuova Zelanda 8
BH - Bahrain 7
HR - Croazia 7
MO - Macao, regione amministrativa speciale della Cina 7
RS - Serbia 7
SC - Seychelles 7
SK - Slovacchia (Repubblica Slovacca) 7
BB - Barbados 6
BO - Bolivia 6
CR - Costa Rica 6
CY - Cipro 6
EE - Estonia 6
KG - Kirghizistan 6
MD - Moldavia 6
NO - Norvegia 6
DO - Repubblica Dominicana 5
HN - Honduras 5
KW - Kuwait 5
SN - Senegal 5
BA - Bosnia-Erzegovina 4
LB - Libano 4
LK - Sri Lanka 4
LV - Lettonia 4
PS - Palestinian Territory 4
Totale 40.700
Città #
Singapore 2.142
Ashburn 1.646
Santa Clara 1.399
Fairfield 1.362
Hong Kong 1.199
Southend 958
Modena 901
San Jose 865
Hefei 856
Chandler 774
Elâzığ 670
Woodbridge 668
Seattle 614
Houston 595
Beijing 521
Cambridge 520
Wilmington 444
Ann Arbor 384
London 353
Seoul 353
Los Angeles 345
Ho Chi Minh City 342
Nyköping 342
Milan 320
Bologna 291
Jakarta 261
Council Bluffs 250
Dearborn 247
Hanoi 239
Jacksonville 237
Helsinki 235
New York 230
Buffalo 216
The Dalles 205
Rome 203
Chicago 197
Boardman 193
Tokyo 152
Reggio Emilia 140
San Diego 136
Lauterbourg 127
Munich 127
Parma 113
Nuremberg 100
Shanghai 97
Princeton 89
Sofia 87
Amsterdam 84
São Paulo 82
Bangkok 81
Dallas 79
Frankfurt am Main 79
Orem 79
Redwood City 79
Dublin 75
Florence 73
Izmir 73
Kent 73
Montreal 73
Dong Ket 69
Phoenix 68
Mexico City 63
Moscow 63
Salt Lake City 60
Eugene 59
Pisa 59
Naples 58
Manchester 57
Chennai 55
Taipei 54
Toronto 53
Bomporto 50
Bremen 49
Da Nang 48
Vienna 48
Haiphong 47
Paris 47
Kuala Selangor 46
Formigine 45
Warsaw 45
Atlanta 44
Falkenstein 44
Zurich 44
Brussels 43
Manila 42
Columbus 40
Turin 39
Fremont 36
Lappeenranta 35
Ottawa 35
Guangzhou 33
Johannesburg 33
Falls Church 32
San Francisco 32
Trento 32
Wilmette 32
Tampa 31
Denver 30
Düsseldorf 30
Nanjing 29
Totale 25.534
Nome #
What was Monet seeing while painting? Translating artworks to photo-realistic images 650
Spaghetti Labeling: Directed Acyclic Graphs for Block-Based Connected Components Labeling 647
MissRAG: Addressing the Missing Modality Challenge in Multimodal Large Language Models 613
Connected Components Labeling on DRAGs 601
Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era 584
Visual-Semantic Alignment Across Domains Using a Semi-Supervised Approach 557
Safe-CLIP: Removing NSFW Concepts from Vision-and-Language Models 525
Attentive Models in Vision: Computing Saliency Maps in the Deep Learning Era 518
Towards Cycle-Consistent Models for Text and Image Retrieval 499
Artpedia: A New Visual-Semantic Dataset with Visual and Contextual Sentences in the Artistic Domain 478
Modeling Multimodal Cues in a Deep Learning-based Framework for Emotion Recognition in the Wild 471
Automatic Image Cropping and Selection using Saliency: an Application to Historical Manuscripts 469
Connected Components Labeling on DRAGs: Implementation and Reproducibility Notes 459
YACCLAB - Yet Another Connected Components Labeling Benchmark 436
M-VAD Names: a Dataset for Video Captioning with Naming 430
Show, Control and Tell: A Framework for Generating Controllable and Grounded Captions 423
Learning to Read L'Infinito: Handwritten Text Recognition with Synthetic Training Data 421
Layout analysis and content classification in digitized books 418
Explaining Digital Humanities by Aligning Images and Textual Descriptions 415
Aligning Text and Document Illustrations: towards Visually Explainable Digital Humanities 410
A Deep Multi-Level Network for Saliency Prediction 409
Watch Your Strokes: Improving Handwritten Text Recognition with Deformable Convolutions 407
A Hierarchical Quasi-Recurrent approach to Video Captioning 391
Predicting Human Eye Fixations via an LSTM-based Saliency Attentive Model 390
A Browsing and Retrieval System for Broadcast Videos using Scene Detection and Automatic Annotation 390
Recognizing social relationships from an egocentric vision perspective 386
Optimized Connected Components Labeling with Pixel Prediction 381
A Deep Siamese Network for Scene Detection in Broadcast Videos 380
A Video Library System Using Scene Detection and Automatic Tagging 377
Unveiling the Impact of Image Transformations on Deepfake Detection: An Experimental Analysis 376
Historical Document Digitization through Layout Analysis and Deep Content Classification 373
Context Change Detection for an Ultra-Low Power Low-Resolution Ego-Vision Imager 371
Analysis and Re-use of Videos in Educational Digital Libraries with Automatic Scene Detection 370
Image-to-Image Translation to Unfold the Reality of Artworks: an Empirical Analysis 370
Hierarchical Boundary-Aware Neural Encoder for Video Captioning 367
Wiki-LLaVA: Hierarchical Retrieval-Augmented Generation for Multimodal LLMs 365
Shot and Scene Detection via Hierarchical Clustering for Re-using Broadcast Video 364
Hand Segmentation for Gesture Recognition in EGO-Vision 362
Art2Real: Unfolding the Reality of Artworks via Semantically-Aware Image-to-Image Translation 361
Positive-Augmented Contrastive Learning for Image and Video Captioning Evaluation 357
Measuring scene detection performance 355
Dual-Branch Collaborative Transformer for Virtual Try-On 354
The Revolution of Multimodal Large Language Models: A Survey 353
SynthCap: Augmenting Transformers with Synthetic Data for Image Captioning 352
Recognizing and Presenting the Storytelling Video Structure with Deep Multimodal Networks 351
Gesture Recognition using Wearable Vision Sensors to Enhance Visitors' Museum Experiences 351
Ai4ar: An ai-based mobile application for the automatic generation of ar contents 350
SAM: Pushing the Limits of Saliency Prediction Models 342
Gesture Recognition in Ego-Centric Videos using Dense Trajectories and Hand Segmentation 341
Visual Saliency for Image Captioning in New Multimedia Services 339
Multi-Level Net: a Visual Saliency Prediction Model 334
LAMV: Learning to align and match videos with kernelized temporal layers 330
Explore and Explain: Self-supervised Navigation and Recounting 322
From Show to Tell: A Survey on Deep Learning-based Image Captioning 322
A Novel Attention-based Aggregation Function to Combine Vision and Language 320
Tracing Information Flow in LLaMA Vision: A Step Toward Multimodal Understanding 317
Multimodal Attention Networks for Low-Level Vision-and-Language Navigation 310
Paying More Attention to Saliency: Image Captioning with Saliency and Context Attention 304
Scene segmentation using temporal clustering for accessing and re-using broadcast video 303
Scene-driven Retrieval in Edited Videos using Aesthetic and Semantic Deep Features 303
Towards Video Captioning with Naming: a Novel Dataset and a Multi-Modal Approach 301
Meshed-Memory Transformer for Image Captioning 296
CaMEL: Mean Teacher Learning for Image Captioning 292
Embodied Agents for Efficient Exploration and Smart Scene Description 290
Towards Reliable Experiments on the Performance of Connected Components Labeling Algorithms 290
Boosting Modern and Historical Handwritten Text Recognition with Deformable Convolutions 289
Recurrence-Enhanced Vision-and-Language Transformers for Robust Multimodal Document Retrieval 287
Contrasting Deepfakes Diffusion via Contrastive Learning and Global-Local Similarities 283
A Computational Approach for Progressive Architecture Shrinkage in Action Recognition 277
A Unified Cycle-Consistent Neural Model for Text and Image Retrieval 275
Video action detection by learning graph-based spatio-temporal interactions 268
Investigating Bidimensional Downsampling in Vision Transformer Models 268
A Deep-learning-based approach to VM behavior Identification in Cloud Systems 267
NeuralStory: an Interactive Multimedia System for Video Indexing and Re-use 264
Retrieval-Augmented Transformer for Image Captioning 263
Adapt to Scarcity: Few-Shot Deepfake Detection via Low-Rank Adaptation 262
Semantically Conditioned Prompts for Visual Recognition under Missing Modality Scenarios 246
Embodied Vision-and-Language Navigation with Dynamic Convolutional Filters 246
Intelligent Multimodal Artificial Agents that Talk and Express Emotions 245
Focus on Impact: Indoor Exploration with Intrinsic Motivation 243
Embodied Navigation at the Art Gallery 241
Learning to Select: A Fully Attentive Approach for Novel Object Captioning 240
Revisiting The Evaluation of Class Activation Mapping for Explainability: A Novel Metric and Experimental Analysis 238
Assessing the Role of Boundary-level Objectives in Indoor Semantic Segmentation 238
Hyperbolic Safety-Aware Vision-Language Models 238
The Unreasonable Effectiveness of CLIP features for Image Captioning: an Experimental Analysis 236
Matching Faces and Attributes Between the Artistic and the Real Domain: the PersonArt Approach 234
RMS-Net: Regression and Masking for Soccer Event Spotting 233
Improving Indoor Semantic Segmentation with Boundary-level Objectives 233
BRIDGE: Bridging Gaps in Image Captioning Evaluation with Stronger Visual Cues 229
Are Learnable Prompts the Right Way of Prompting? Adapting Vision-and-Language Models with Memory Optimization 228
FOSSIL: Free Open-Vocabulary Semantic Segmentation through Synthetic References Retrieval 225
Estimating (and fixing) the Effect of Face Obfuscation in Video Recognition 221
Verifier Matters: Enhancing Inference-Time Scaling for Video Diffusion Models 218
Towards Explainable Navigation and Recounting 218
ALADIN: Distilling Fine-grained Alignment Scores for Efficient Image-Text Matching and Retrieval 218
Multimodal Emotion Recognition in Conversation via Possible Speaker's Audio and Visual Sequence Selection 214
With a Little Help from your own Past: Prototypical Memory Networks for Image Captioning 213
The LAM Dataset: A Novel Benchmark for Line-Level Handwritten Text Recognition 213
Working Memory Connections for LSTM 210
Totale 34.214
Categoria #
all - tutte 138.644
article - articoli 0
book - libri 0
conference - conferenze 0
curatela - curatele 0
other - altro 0
patent - brevetti 0
selected - selezionate 0
volume - volumi 0
Totale 138.644


Totale Lug Ago Sett Ott Nov Dic Gen Feb Mar Apr Mag Giu
2021/20223.237 183 139 240 169 84 214 186 230 311 335 825 321
2022/20232.767 364 299 246 227 301 283 92 226 386 75 144 124
2023/20242.525 243 170 246 312 439 164 120 183 70 201 131 246
2024/20258.439 710 242 260 496 1.229 925 512 650 1.081 567 803 964
2025/202615.568 1.084 780 1.254 1.506 2.128 1.051 1.925 1.288 1.290 1.678 878 706
2026/202785 85 0 0 0 0 0 0 0 0 0 0 0
Totale 41.383