Bottom-up approach to collecting data: blessing and curse for a large linguistic resourceдоклад на конференции