Spring Batch how to filter duplicated items before send it to ItemWriter

Tags:

I read a flat file (for example a .csv file with 1 line per User, Ex: UserId;Data1;Date2).

But how to handle duplicated User item in the reader (where is no list of previus readed users...)

stepBuilderFactory.get("createUserStep1")
.<User, User>chunk(1000)
.reader(flatFileItemReader) // FlatFileItemReader
.writer(itemWriter) // For example JDBC Writer
.build();

592

asked Dec 05 '14 14:12

Aure77

1 Answers

Filtering is typically done with an ItemProcessor. If the ItemProcessor returns null, the item is filtered and not passed to the ItemWriter. Otherwise, it is. In your case, you could keep a list of previously seen users in the ItemProcessor. If the user hasn't been seen before, pass it on. If it has been seen before, return null. You can read more about filtering with an ItemProcessor in the documentation here: http://docs.spring.io/spring-batch/trunk/reference/html/readersAndWriters.html#filiteringRecords

/**
* This implementation assumes that there is enough room in memory to store the duplicate
* Users.  Otherwise, you'd want to store them somewhere you can do a look-up on.
*/
public class UserFilterItemProcessor implements ItemProcessor<User, User> {

    // This assumes that User.equals() identifies the duplicates
    private Set<User> seenUsers = new HashSet<User>();

    public User process(User user) {
        if(seenUsers.contains(user)) {
            return null;
        }
        seenUsers.add(user);
        return user;

    }
}

192

answered Sep 18 '22 23:09

Michael Minella

Related questions
                            
                                Cannot find org.aspectj.weaver.reflect.ReflectionWorld
                            
                                jpa UnsupportedOperationException: query result offset is not supported
                            
                                No bean named 'transactionManager' available
                            
                                Spring Security 5 : No Beans of type BCryptPasswordEncoder found
                            
                                Spring Jpa Update: Can not issue data manipulation statements with executeQuery()
                            
                                ReactiveSecurityContextHolder.getContext() is empty but @AuthenticationPrincipal works
                            
                                How to implement Logout feature using Spring Web Mvc
                            
                                In a project that uses a DI framework, should you NEVER use the 'new' operator?
                            
                                applicationContext object in JSP
                            
                                Set System Properties or Environment Variables Before Property Placeholder with SpringJunit4ClassRunner
                            
                                @ManagedProperty in a Spring managed bean is null
                            
                                How to declare a JSF managed bean in a Spring 3.1 application?
                            
                                Display error messages in Spring login
                            
                                NoSuchMethodError: org.jboss.logging.Logger.debugf
                            
                                Eclipse Maven: run test with spring profile
                            
                                getting "Whitelabel Error Page" running actuator health and mappings urls
                            
                                Spring/Spring bean configuration file option is missing in STS
                            
                                Injecting BeanFactory into a Bean
                            
                                Jboss deploying in root context
                            
                                How to execute SQL insert queries to populate database during application start/load?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Spring Batch how to filter duplicated items before send it to ItemWriter

Tags:

spring

batch-processing

spring-batch

Aure77

People also ask

1 Answers

Michael Minella

Recent Activity

Donate For Us