Tag Archives: Florence

Supervised graduate students

I have supervised several graduate students (Laëtitia Letoupin, Antonin Molières, Alexis Malécot, Ho Tien Lam) for the development of a Video Indexing tool in C++/Qt while doing my PhD in Bordeaux.

Since joining the MICC, I have supervised multiple italian students on different projects:

  • Alessio Benevieri, Railway Wagon Id Automatic recognition
  • Federico Bartoli, Master thesis “Fast pedestrian detection via geometric and Soft Cascade approximation” [1]

  • Claudio Tortorici, Master thesis “Relaxed Decision Trees over multiple Taxonomies for Visual Recognition”

  • Giovanni Giunto, Master thesis “Towards Spatial Codebook-free Methods for Image Classification”

  • Andrea Ciolini, Master thesis “Object Detection on Low Power Devices” [2]

[1] [pdf] F. Bartoli, G. Lisanti, S. Karaman, A. D. Bagdanov, and A. Del Bimbo, “Unsupervised scene adaptation for faster multi-scale pedestrian detection,” in 22nd International Conference on Pattern Recognition (ICPR), Stockholm, Sweden, 2014.
[Bibtex]
@InProceedings{bartoliicpr2014,
author = {Bartoli, Federico and Lisanti, Giuseppe and Karaman, Svebor and Bagdanov, Andrew D. and Del Bimbo, Alberto},
title = {Unsupervised scene adaptation for faster multi-scale pedestrian detection},
note = {Oral presentation},
booktitle = {22nd International Conference on Pattern Recognition (ICPR)},
address = {Stockholm, Sweden},
year = {2014}
}
[2] [pdf] A. Ciolini, L. Seidenari, S. Karaman, and A. Del Bimbo, “Efficient Hough Forest Object Detection for Low-power Devices,” in IEEE First International Workshop on Wearable and Ego-vision Systems for Augmented Experience (WEsAX), 2015.
[Bibtex]
@inproceedings{ciolini2015,
author = {Ciolini, Andrea and Seidenari, Lorenzo and Karaman, Svebor and Del Bimbo, Alberto},
title = {Efficient Hough Forest Object Detection for Low-power Devices},
booktitle = {IEEE First International Workshop on Wearable and Ego-vision Systems for Augmented Experience (WEsAX)},
year = {2015}
}

Teaching activity

I have mostly taught in Bordeaux during my PhD and during a research and teaching assistant position after my PhD. I have recently gave a lecture on Video Coding and Representation in Italian. Up to now, my teaching activities account for 200+ hours.

  • University of Florence: (In Italian)
    • Lecture Course on Video Coding and Representation (MPEG4-MPEG7), BSc level
  • IUT Bordeaux 1 : (In French)
    • Lecture Course on Video Analysis, BSc level
  • ENSEIRB : (In French)
    • Practicals on Video Indexing, MSc level, option “Multimedia Technologies”
    • Project on Video Analysis, MSc level, option “Multimedia Technologies”
    • Tutorial Class on Work Environment, BSc level
    • Programming Projects, BSc level
  • University of Bordeaux 1 : (In French)
    • Computer Science and Internet Certificate (C2I), BSc level
    • Softwares – Rebound Semester, BSc level
    • Master students’ internships follow-up

International Conferences and Workshops

2015

  • [PDF] A. Ciolini, L. Seidenari, S. Karaman, and A. Del Bimbo, “Efficient Hough Forest Object Detection for Low-power Devices,” in IEEE First International Workshop on Wearable and Ego-vision Systems for Augmented Experience (WEsAX), 2015.
    [Bibtex]
    @inproceedings{ciolini2015,
    author = {Ciolini, Andrea and Seidenari, Lorenzo and Karaman, Svebor and Del Bimbo, Alberto},
    title = {Efficient Hough Forest Object Detection for Low-power Devices},
    booktitle = {IEEE First International Workshop on Wearable and Ego-vision Systems for Augmented Experience (WEsAX)},
    year = {2015}
    }
  • [PDF] F. Bartoli, L. Seidenari, G. Lisanti, S. Karaman, and A. Del Bimbo, “WATSS: a Web Annotation Tool for Surveillance Scenarios,” in ACM Multimedia 2015 Open Source Software Competition, 2015.
    [Bibtex]
    @inproceedings{bartoli2015watss,
    title = {WATSS: a Web Annotation Tool for Surveillance Scenarios},
    author = {Bartoli, Federico and Seidenari, Lorenzo and Lisanti, Giuseppe and Karaman, Svebor and Del Bimbo, Alberto},
    booktitle = {ACM Multimedia 2015 Open Source Software Competition},
    year = {2015}
    }
  • [PDF] F. Bartoli, G. Lisanti, L. Seidenari, S. Karaman, and A. Del Bimbo, “MuseumVisitors: A Dataset for Pedestrian and Group Detection, Gaze Estimation and Behavior Understanding,” in Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, 2015, p. 19–27.
    [Bibtex]
    @inproceedings{bartoli2015museumvisitors,
    title={MuseumVisitors: A Dataset for Pedestrian and Group Detection, Gaze Estimation and Behavior Understanding},
    author={Bartoli, Federico and Lisanti, Giuseppe and Seidenari, Lorenzo and Karaman, Svebor and Del Bimbo, Alberto},
    booktitle={Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops},
    pages={19--27},
    year={2015}
    }

2014

  • [PDF] S. Karaman, L. Seidenari, S. Ma, A. Del Bimbo, and S. Sclaroff, “Adaptive Structured Pooling for Action Recognition,” in Proc. of British Machine Vision Conference (BMVC), Nottingham, UK, 2014.
    [Bibtex]
    @InProceedings{karamanbmvc2014,
    author = "Karaman, Svebor and Seidenari, Lorenzo and Ma, Shugao and Del Bimbo, Alberto and Sclaroff, Stan",
    title = "Adaptive Structured Pooling for Action Recognition",
    booktitle = "Proc. of British Machine Vision Conference (BMVC)",
    address = "Nottingham, UK",
    year = "2014",
    note = "Poster",
    }
  • [PDF] F. Bartoli, G. Lisanti, S. Karaman, A. D. Bagdanov, and A. Del Bimbo, “Unsupervised scene adaptation for faster multi-scale pedestrian detection,” in 22nd International Conference on Pattern Recognition (ICPR), Stockholm, Sweden, 2014.
    [Bibtex]
    @InProceedings{bartoliicpr2014,
    author = {Bartoli, Federico and Lisanti, Giuseppe and Karaman, Svebor and Bagdanov, Andrew D. and Del Bimbo, Alberto},
    title = {Unsupervised scene adaptation for faster multi-scale pedestrian detection},
    note = {Oral presentation},
    booktitle = {22nd International Conference on Pattern Recognition (ICPR)},
    address = {Stockholm, Sweden},
    year = {2014}
    }

2013

  • [PDF] S. Karaman, L. Seidenari, A. D. Bagdanov, and A. Del Bimbo, “L1-regularized Logistic Regression Stacking and CRF Smoothing for Action Recognition,” in THUMOS: ICCV Workshop on Action Recognition with a Large Number of Classes, 2013.
    [Bibtex]
    @InProceedings{karamanthumos2013,
    author = "Karaman, Svebor and Seidenari, Lorenzo and Bagdanov, Andrew D. and Del Bimbo, Alberto",
    title = "L1-regularized Logistic Regression Stacking and CRF Smoothing for Action Recognition",
    booktitle = "THUMOS: ICCV Workshop on Action Recognition with a Large Number of Classes",
    year = "2013",
    note = {Oral presentation. Ranked #2 of the Action Recognition Challenge}
    }
  • [PDF] [DOI] S. Karaman, A. D. Bagdanov, G. D’Amico, L. Landucci, A. Ferracani, D. Pezzatini, and A. Del Bimbo, “Passive Profiling and Natural Interaction Metaphors for Personalized Multimedia Museum Experiences,” in MM4CH’13 – New Trends in Image Analysis and Processing – ICIAP 2013, Naples, Italy: Springer, 2013, p. 247–256.
    [Bibtex]
    @incollection{karaman2013passive,
    title = {Passive Profiling and Natural Interaction Metaphors for Personalized Multimedia Museum Experiences},
    author = {Karaman, Svebor and Bagdanov, Andrew D and D’Amico, Gianpaolo and Landucci, Lea and Ferracani, Andrea and Pezzatini, Daniele and Del Bimbo, Alberto},
    booktitle = {MM4CH'13 - New Trends in Image Analysis and Processing -- ICIAP 2013},
    doi = {10.1007/978-3-642-41190-8_27},
    pages = {247--256},
    address = {Naples, Italy},
    year = {2013},
    note={Oral Presentation},
    publisher = {Springer}
    }
  • [PDF] [DOI] A. D. Bagdanov, A. Del Bimbo, D. Di Fina, S. Karaman, G. Lisanti, and I. Masi, “Multi-Target Data Association using Sparse Reconstruction,” in Proc. of International Conference on Image Analysis and Processing (ICIAP), Naples, Italy, 2013, pp. 239-248.
    [Bibtex]
    @inproceedings{DBLMKD13,
    author = {Bagdanov, Andrew D. and Del Bimbo, Alberto and Di Fina, Dario and Karaman, Svebor and Lisanti, Giuseppe and Masi, Iacopo},
    title = {Multi-Target Data Association using Sparse Reconstruction},
    booktitle = {Proc. of International Conference on Image Analysis and Processing (ICIAP)},
    year = {2013},
    address = {Naples, Italy},
    pages = {239-248},
    note={Poster},
    doi = {10.1007/978-3-642-41184-7_25},
    publisher = {Springer Berlin Heidelberg},
    keywords = {Data association; multi-target tracking; sparse methods; video surveillance},
    url = {http://www.micc.unifi.it/publications/2013/DBLMKD13}
    }

2012

  • [PDF] [DOI] S. Karaman and A. D. Bagdanov, “Identity Inference: Generalizing Person Re-identification Scenarios,” in Computer Vision – ECCV 2012. Workshops and Demonstrations, A. Fusiello, V. Murino, and R. Cucchiara, Eds., Firenze, Italy: Springer Berlin Heidelberg, 2012, vol. 7583, pp. 443-452.
    [Bibtex]
    @incollection{karamanIdInf2012,
    isbn={978-3-642-33862-5},
    booktitle={Computer Vision – ECCV 2012. Workshops and Demonstrations},
    volume={7583},
    series={Lecture Notes in Computer Science},
    editor={Fusiello, Andrea and Murino, Vittorio and Cucchiara, Rita},
    doi={10.1007/978-3-642-33863-2_44},
    title={Identity Inference: Generalizing Person Re-identification Scenarios},
    url={http://dx.doi.org/10.1007/978-3-642-33863-2_44},
    publisher={Springer Berlin Heidelberg},
    author={Karaman, Svebor and Bagdanov, Andrew D.},
    pages={443-452},
    address = {Firenze, Italy},
    note={Oral Presentation. Best Paper Award},
    year={2012}
    }
  • [PDF] J. Pinquier, S. Karaman, L. Letoupin, P. Guyot, R. Megret, J. Benois-Pineau, Y. Gaestel, and J. -F. Dartigues, “Strategies for multiple feature fusion with Hierarchical HMM: Application to activity recognition from wearable audiovisual sensors,” in 21st International Conference on Pattern Recognition (ICPR), Tsukuba, Japan, 2012, pp. 3192-3195.
    [Bibtex]
    @INPROCEEDINGS{Pinquier2012,
    author={Pinquier, J. and Karaman, S. and Letoupin, L. and Guyot, P. and Megret, R. and Benois-Pineau, J. and Gaestel, Y. and Dartigues, J.-F.},
    booktitle={21st International Conference on Pattern Recognition (ICPR)},
    title={Strategies for multiple feature fusion with Hierarchical HMM: Application to activity recognition from wearable audiovisual sensors},
    year={2012},
    month={Nov},
    pages={3192-3195},
    abstract={In this paper, we further develop the research on recognition of activities, in videos recorded with wearable cameras, with Hierarchical Hidden Markov Model classifiers. The visual scenes being of a strong complexity in terms of motion and visual content, good performances have been obtained using multiple visual and audio cues. The adequate fusion of features from physically different description spaces remains an open issue not only for this particular task, but in multiple problems of pattern recognition. A study of optimal fusion strategies in the HMM framework is proposed. We design and exploit early, intermediate and late fusions with emitting states in the H-HMM. The results obtained on a corpus recorded by healthy volunteers and patients in a longitudinal dementia study allow choosing optimal fusion strategies as a function of target activity.},
    keywords={gesture recognition;hidden Markov models;image fusion;video signal processing;H-HMM;activity recognition;description spaces;early fusions;healthy volunteers;hierarchical HMM classifier;hierarchical hidden Markov model classifiers;intermediate fusions;late fusions;longitudinal dementia study;motion content;multiple feature fusion;optimal fusion strategies;pattern recognition;strong complexity;target activity;visual content;visual scenes;wearable audiovisual sensors;wearable cameras;Cameras;Hidden Markov models;Multimedia communication;Pattern recognition;Streaming media;Videos;Visualization},
    url = {http://ieeexplore.ieee.org/xpl/freeabs_all.jsp?arnumber=6460843},
    note={Poster},
    address = {Tsukuba, Japan},
    ISSN={1051-4651}
    }
  • [PDF] [DOI] S. Karaman, J. Benois-Pineau, R. Mégret, and A. Bugeau, “Multi-layer Local Graph Words for Object Recognition,” in Advances in Multimedia Modeling, K. Schoeffmann, B. Merialdo, A. Hauptmann, C. Ngo, Y. Andreopoulos, and C. Breiteneder, Eds., Klagenfurt, Austria: Springer Berlin Heidelberg, 2012, vol. 7131, pp. 29-39.
    [Bibtex]
    @incollection{karamanMMM2012,
    isbn={978-3-642-27354-4},
    booktitle={Advances in Multimedia Modeling},
    volume={7131},
    series={Lecture Notes in Computer Science},
    editor={Schoeffmann, Klaus and Merialdo, Bernard and Hauptmann, AlexanderG. and Ngo, Chong-Wah and Andreopoulos, Yiannis and Breiteneder, Christian},
    doi={10.1007/978-3-642-27355-1_6},
    title={Multi-layer Local Graph Words for Object Recognition},
    url={http://dx.doi.org/10.1007/978-3-642-27355-1_6},
    publisher={Springer Berlin Heidelberg},
    keywords={Feature representation; Structural features; Bag-of-Visual-Words; Graph Words; Delaunay triangulation; Context Dependent Kernel},
    author={Karaman, Svebor and Benois-Pineau, Jenny and Mégret, Rémi and Bugeau, Aurélie},
    note={Oral Presentation},
    address = {Klagenfurt, Austria},
    pages={29-39},
    year={2012}
    }

2011

  • [PDF] [DOI] S. Karaman, J. Benois-Pineau, R. Mégret, J. Pinquier, Y. Gaestel, and J. -F. Dartigues, “Activities of daily living indexing by hierarchical HMM for dementia diagnostics,” in 9th International Workshop on Content-Based Multimedia Indexing (CBMI), Madrid, Spain, 2011, pp. 79-84.
    [Bibtex]
    @INPROCEEDINGS{karamanCBMI2011,
    author={Karaman, S. and Benois-Pineau, J. and Mégret, R. and Pinquier, J. and Gaestel, Y. and Dartigues, J.-F.},
    booktitle={9th International Workshop on Content-Based Multimedia Indexing (CBMI)},
    title={Activities of daily living indexing by hierarchical HMM for dementia diagnostics},
    year={2011},
    month={June},
    address = {Madrid, Spain},
    pages={79-84},
    abstract={This paper presents a method for indexing human activities in videos captured from a wearable camera being worn by patients, for studies of progression of the dementia diseases. Our method aims to produce indexes to facilitate the navigation throughout the individual video recordings, which could help doctors search for early signs of the disease in the activities of daily living. The recorded videos have strong motion and sharp lighting changes, inducing noise for the analysis. The proposed approach is based on a two steps analysis. First, we propose a new approach to segment this type of video, based on apparent motion. Each segment is characterized by two original motion descriptors, as well as color, and audio descriptors. Second, a Hidden-Markov Model formulation is used to merge the multimodal audio and video features, and classify the test segments. Experiments show the good properties of the approach on real data.},
    keywords={hidden Markov models;image colour analysis;image segmentation;indexing;medical diagnostic computing;medical disorders;video recording;audio descriptors;color descriptors;daily living indexing;dementia diagnostics;dementia diseases;hidden-Markov model formulation;hierarchical HMM;human activities indexing;multimodal audio features;original motion descriptors;recorded videos;test segments;two steps analysis;video features;video recordings;wearable camera;Accuracy;Cameras;Dynamics;Hidden Markov models;Histograms;Motion segmentation;Videos},
    doi={10.1109/CBMI.2011.5972524},
    note={Oral Presentation},
    ISSN={1949-3983}
    }
  • [PDF] Y. Gaëstel, S. Karaman, R. Megret, O. Cherifa, T. Francoise, B. Jenny, and J. Dartigues, “Autonomy at home and early diagnosis in Alzheimer’s Disease: Utility of video indexing applied to clinical issues, the IMMED project,” in Alzheimer’s Association International Conference on Alzheimer’s Disease (AAICAD), Paris, France, 2011, p. S245.
    [Bibtex]
    @inproceedings{gaestel2011,
    hal_id = {hal-00978228},
    url = {http://hal.archives-ouvertes.fr/hal-00978228},
    title = {Autonomy at home and early diagnosis in Alzheimer's Disease: Utility of video indexing applied to clinical issues, the IMMED project},
    author = {Ga{\"e}stel, Yann and Karaman, Svebor and Megret, R{\'e}mi and Cherifa, Onifade-Fagbe and Francoise, Trophy and Jenny, Benois-Pineau and Dartigues, Jean-Fran{\c c}ois},
    abstract = {With ageing of the population in the world, patients with Alzheimer's disease (AD) consequently increase. People suffering from this pathology show early modifications in their "activities of daily living". Those abilities modifications are part of the dementia diagnosis, but are often not reported by the patients or their families. Being able to capture these early signs of autonomy loss could be a way to diagnose earlier dementia and to prevent insecurity at home. We first developed a wearable camera (shoulder mounted) to capture people's activity at home in a non-invasive manner. We then developed a video-indexing methodology to help physicians explore their patients' home-recorded video. This video indexing system requires video and audio analyses to automatically identify and index activities of interest where insecurity or risks could be highlightened. Patients are recruited among the Bagatelle (Talence, France) Memory clinic department patients and are suffering from mild cognitive impairments or very mild AD. We met ten patients at home and we recorded one hour of daily activities for each. The data (video and questionnaires: Activities of Daily Living/Instrumental Activities of Daily Living) are now collected on an extended sample of people suffering from mild cognitive impairments and from very mild AD. We aimed at evaluating behavioral modifications and ability loss detection by comparing the subjects' self reported questionnaires and the video analyses. This project is a successful collaboration between various fields of research. Here, technology is developed to be helpful in everyday challenges that people suffering from dementia of the Alzheimer type are faced with. The automation of the video indexing could be a great step forward in video analysis if it could reduce the time needed to embrace the patient's lifestream, helping in early diagnosis of dementia and becoming a very useful tool to keep individuals safe at home. In fact, many goals could be reached with such video analyses: an early diagnosis of dementia of the Alzheimer type, avoiding danger in home living and evaluating the progression of the disease or the effects of the various therapies (drug-therapy and others).},
    language = {Anglais},
    affiliation = {Institut de Sant{\'e} Publique, d'Epid{\'e}miologie et de D{\'e}veloppement - ISPED , Laboratoire Bordelais de Recherche en Informatique - LaBRI , Laboratoire de l'int{\'e}gration, du mat{\'e}riau au syst{\`e}me - IMS , MSPB Bagatelle - MSPB , Epid{\'e}miologie et Biostatistique},
    booktitle = {{Alzheimer's Association International Conference on Alzheimer's Disease (AAICAD)}},
    pages = {S245},
    address = {Paris, France},
    editor = {Alzheimer's \& Dementia: The Journal of the Alzheimer's Association },
    audience = {internationale },
    note = {Poster presentation. Abstract published in Journal of Alzheimer's and Dementia, volume 7 (4), pp. S245, July 2011},
    collaboration = {IMMED },
    year = {2011},
    month = {Jul}
    }

2010

  • [PDF] [DOI] S. Karaman, J. Benois-Pineau, R. Mégret, V. Dovgalecs, J. -F. Dartigues, and Y. Gaëstel, “Human Daily Activities Indexing in Videos from Wearable Cameras for Monitoring of Patients with Dementia Diseases,” in 20th International Conference on Pattern Recognition (ICPR), Istanbul, Turkey, 2010, pp. 4113-4116.
    [Bibtex]
    @INPROCEEDINGS{karamanICPR2010,
    author={Karaman, S. and Benois-Pineau, J. and Mégret, R. and Dovgalecs, V. and Dartigues, J.-F. and Gaëstel, Y.},
    booktitle={20th International Conference on Pattern Recognition (ICPR)},
    title={Human Daily Activities Indexing in Videos from Wearable Cameras for Monitoring of Patients with Dementia Diseases},
    year={2010},
    month={Aug},
    pages={4113-4116},
    abstract={Our research focuses on analysing human activities according to a known behaviorist scenario, in case of noisy and high dimensional collected data. The data come from the monitoring of patients with dementia diseases by wearable cameras. We define a structural model of video recordings based on a Hidden Markov Model. New spatio-temporal features, color features and localization features are proposed as observations. First results in recognition of activities are promising.},
    keywords={feature extraction;hidden Markov models;image colour analysis;image motion analysis;video cameras;video recording;video signal processing;activity recognition;behaviorist scenario;color features;dementia disease patients;hidden Markov model;human activity indexing;localization features;patient monitoring;spatiotemporal features;video recordings;wearable cameras;Biomedical monitoring;Cameras;Hidden Markov models;Histograms;Image color analysis;Motion segmentation;Videos;Bag of Features;HMM;Localization;Monitoring;Video Indexing},
    doi={10.1109/ICPR.2010.999},
    note={Oral Presentation},
    ISSN={1051-4651},
    address={Istanbul, Turkey}
    }
  • [PDF] [DOI] R. Mégret, V. Dovgalecs, H. Wannous, S. Karaman, J. Benois-Pineau, E. El Khoury, J. Pinquier, P. Joly, R. André-Obrecht, Y. Gaëstel, and J. Dartigues, “The IMMED Project: Wearable Video Monitoring of People with Age Dementia,” in Proceedings of the International Conference on Multimedia (ACMMM), Firenze, Italy, 2010, p. 1299–1302.
    [Bibtex]
    @inproceedings{Megret2010,
    author = {M{\'e}gret, R{\'e}mi and Dovgalecs, Vladislavs and Wannous, Hazem and Karaman, Svebor and Benois-Pineau, Jenny and El Khoury, Elie and Pinquier, Julien and Joly, Philippe and Andr{\'e}-Obrecht, R{\'e}gine and Ga\"{e}stel, Yann and Dartigues, Jean-Fran\c{c}ois},
    title = {The IMMED Project: Wearable Video Monitoring of People with Age Dementia},
    booktitle = {Proceedings of the International Conference on Multimedia (ACMMM)},
    series = {MM '10},
    year = {2010},
    isbn = {978-1-60558-933-6},
    address = {Firenze, Italy},
    pages = {1299--1302},
    numpages = {4},
    url = {http://doi.acm.org/10.1145/1873951.1874206},
    doi = {10.1145/1873951.1874206},
    acmid = {1874206},
    note = {Video program},
    publisher = {ACM},
    keywords = {audio and video indexing, patient monitoring, wearable camera}
    }