Remove objects with a duplicate property from List

Question

I have a List of objects in C#. All of the objects contain a property ID. There are several objects that have the same ID property.

How can I trim the List (or make a new List) where there is only one object per ID property?

[Any additional duplicates are dropped out of the List]

Daniel Lord · Accepted Answer · 2020-09-29 09:39:24Z

230

If you want to avoid using a third-party library, you could do something like:

var bar = fooArray.GroupBy(x => x.Id).Select(x => x.First()).ToList();

That will group the array by the Id property, then select the first entry in the grouping.

edited Sep 29, 2020 at 9:39

Daniel Lord

7825 silver badges18 bronze badges

answered Apr 3, 2012 at 12:25

Daniel Mann

58.5k13 gold badges105 silver badges125 bronze badges

11

This worked perfectly here is my implementation: List<InputRow> uniqueRows = inputRows.GroupBy(x => x.Id).Select(x => x.First()).ToList<InputRow>();
– Baxter
Commented Apr 3, 2012 at 13:24
7

Glad to help! One note: The <InputRow> on your ToList() is redundant. You should be able to just do .ToList()
– Daniel Mann
Commented Apr 3, 2012 at 13:43
1

You are right it works with just ToList() instead of ToList<InputRow>()
– Baxter
Commented Apr 3, 2012 at 16:20
a good alternatif than trying to figure out why using distinct and iquatable not working.
– Ariwibawa
Commented Nov 19, 2020 at 4:55

Add a comment |

Kolappan N · Accepted Answer · 2018-11-30 14:26:44Z

38

MoreLINQ DistinctBy() will do the job, it allows using object proeprty for the distinctness. Unfortunatly built in LINQ Distinct() not flexible enoght.

var uniqueItems = allItems.DistinctBy(i => i.Id);

DistinctBy()

Returns all distinct elements of the given source, where "distinctness" is determined via a projection and the default eqaulity comparer for the projected type.

PS: Credits to Jon Skeet for sharing this library with community

edited Nov 30, 2018 at 14:26

Kolappan N

3,8512 gold badges37 silver badges43 bronze badges

answered Apr 3, 2012 at 12:26

sll

62.3k22 gold badges106 silver badges157 bronze badges

1

I think this is a great solution but am trying to avoid using a 3rd party library for this. Thank You.
– Baxter
Commented Apr 3, 2012 at 13:26
3

Fortunately you can see how it is implemented
– sll
Commented Apr 3, 2012 at 13:40

Add a comment |

Theodor Zoulias · Accepted Answer · 2023-01-18 14:18:03Z

Starting from .NET 6, a new DistinctBy LINQ operator is available:

public static IEnumerable<TSource> DistinctBy<TSource,TKey> (
    this IEnumerable<TSource> source,
    Func<TSource,TKey> keySelector);

Returns distinct elements from a sequence according to a specified key selector function.

Usage example:

List<Item> distinctList = listWithDuplicates
    .DistinctBy(i => i.Id)
    .ToList();

There is also an overload that has an IEqualityComparer<TKey> parameter.

Update in-place: In case creating a new List<T> is not desirable, here is a RemoveDuplicates extension method for the List<T> class:

/// <summary>
/// Removes all the elements that are duplicates of previous elements,
/// according to a specified key selector function.
/// </summary>
/// <returns>
/// The number of elements removed.
/// </returns>
public static int RemoveDuplicates<TSource, TKey>(
    this List<TSource> source,
    Func<TSource, TKey> keySelector,
    IEqualityComparer<TKey> keyComparer = null)
{
    ArgumentNullException.ThrowIfNull(source);
    ArgumentNullException.ThrowIfNull(keySelector);
    HashSet<TKey> hashSet = new(keyComparer);
    return source.RemoveAll(item => !hashSet.Add(keySelector(item)));
}

This method is efficient (O(n)) but also a bit dangerous, because it is based on the potentially corruptive List<T>.RemoveAll method¹. In case the keySelector lambda succeeds for some elements and then fails for another element, the partially modified List<T> will neither be restored to its initial state, nor it will be in a state recognizable as the result of successful individual Removes. Instead it will transition to a corrupted state that includes duplicate occurrences of existing elements. So in case the keySelector lambda is not fail-proof, the RemoveDuplicates method should be invoked in a try block that has a catch block where the potentially corrupted list is discarded.

Alternatively you could substitute the dangerous built-in RemoveAll with a safe custom implementation, that offers predictable behavior.

¹ _{For all .NET versions and platforms, including the latest .NET 7. I have submitted a proposal on GitHub to document the corruptive behavior of the List<T>.RemoveAll method, and the feedback that I received was that neither the behavior should be documented, nor the implementation should be fixed.}

Kolappan N · Accepted Answer · 2018-11-30 10:22:50Z

var list = GetListFromSomeWhere();
var list2 = GetListFromSomeWhere();
list.AddRange(list2);

....
...
var distinctedList = list.DistinctBy(x => x.ID).ToList();

More LINQ at GitHub

Or if you don't want to use external dlls for some reason, You can use this Distinct overload:

public static IEnumerable<TSource> Distinct<TSource>(
    this IEnumerable<TSource> source, IEqualityComparer<TSource> comparer)

Usage:

public class FooComparer : IEqualityComparer<Foo>
{
    // Products are equal if their names and product numbers are equal.
    public bool Equals(Foo x, Foo y)
    {

        //Check whether the compared objects reference the same data.
        if (Object.ReferenceEquals(x, y)) return true;

        //Check whether any of the compared objects is null.
        if (Object.ReferenceEquals(x, null) || Object.ReferenceEquals(y, null))
            return false;

        return x.ID == y.ID
    }
}



list.Distinct(new FooComparer());

Nikita Popov · Accepted Answer · 2020-07-28 15:29:52Z

4

Not sure if anyone is still looking for any additional ways to do this. But I've used this code to remove duplicates from a list of User objects based on matching ID numbers.

private ArrayList RemoveSearchDuplicates(ArrayList SearchResults)
{
    ArrayList TempList = new ArrayList();

    foreach (User u1 in SearchResults)
    {
        bool duplicatefound = false;
        foreach (User u2 in TempList)
            if (u1.ID == u2.ID)
                duplicatefound = true;

        if (!duplicatefound)
            TempList.Add(u1);
    }
    return TempList;
}

Call: SearchResults = RemoveSearchDuplicates(SearchResults);

edited Jul 28, 2020 at 15:29

Nikita Popov

89410 silver badges21 bronze badges

answered Apr 9, 2013 at 22:52

JScott

651 silver badge1 bronze badge

2

This is pointlessly O(n ^2) when regular GroupBy is just O(n)...
– Alexei Levenkov
Commented Nov 17, 2020 at 21:42

Add a comment |

Collectives™ on Stack Overflow

Remove objects with a duplicate property from List

5 Answers 5

Not the answer you're looking for? Browse other questions tagged
c#
arrays
or ask your own question.

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Not the answer you're looking for? Browse other questions tagged c#arrays or ask your own question.

Linked

Related

Not the answer you're looking for? Browse other questions tagged
c#
arrays
or ask your own question.