To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ?????ぜ臾ょ?柔ロ??ル?艤???λ?B 001111110011111100111111001111110011111110000010101110101110010001101011100000101110010100111111100011110101111110000011100011010011111100111111100000111000101100111111111001000111111000111111001111110011111110000011110010010011111101000010 3f3f3f3f3f82bae46b82e53f8f5f838d3f3f838b3fe47e3f3f3f83c93f42
EUC-JP ???沅?ぜ臾ょ?柔ロ??ルł艤???λ?B 00111111001111110011111110001111110001101110100100111111101001001011110011100111110011001010010011100111001111111011110111000000101001011110110100111111001111111010010111101011100011111010100111001000111001111101111100111111001111110011111110100110110010110011111101000010 3f3f3f8fc6e93fa4bce7cca4e73fbdc0a5ed3f3fa5eb8fa9c8e7df3f3f3fa6cb3f42
UTF-8 嶺뚮뿭沅좄ぜ臾ょ춯柔ロ닏曆ルł艤띹눧類λ♧B 1110111110100110101010111110101110011010101011101110101110111111101011011110011010110010100001011110110010100010100001001110001110000001100111001110100010000111101111101110001110000010100001111110110010110110101011111110011010011111100101001110001110000011101011011110101110001011100011111110111110100110100010111110001110000011101010111100010110000010111010001000100110100100111010111001110110111001111010111000100010100111111011111010011110010000110011101011101111100010100110011010011101000010 efa6abeb9aaeebbfade6b285eca284e3819ce887bee38287ecb6afe69f94e383adeb8b8fefa68be383abc582e889a4eb9db9eb88a7efa790cebbe299a742
UHC 嶺뚮뿭沅좄ぜ臾ょ춯柔ロ닏曆ルł艤띹눧類λ♧B 11100111101011011000110011101011100101111010110111101010101101101010000011101000101010101011110011101011101011001010101011100111101011011000110011101010111101011010101111101101100010001001010111100110101101111010101111101011101010011010100111101011111110101000110111101000100001111011111011101011101110101010010111101011101000101011111101000010 e7ad8ceb97adeab6a0e8aabcebacaae7ad8ceaf5abed8895e6b7abeba9a9ebfa8de887beebbaa5eba2bf42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)