To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?艶?埃?????①佃ゆ暄т佃???訥?? 001111111000100110010000001111111001101010111010001111110011111100111111001111110011111110000111010000001001001011001111100000101110010010011101111101011000010010000100100100101100111100111111001111110011111111100110011000110011111100111111 3f89903f9aba3f3f3f3f3f874092cf82e49df5848492cf3f3f3fe6633f3f
EUC-JP ?艶?埃??????佃ゆ暄т佃???訥?? 0011111110110001111100000011111111010100101111000011111100111111001111110011111100111111001111111100010011010001101001001110011011011010111101111010011111100100110001001101000100111111001111110011111111101011110001000011111100111111 3fb1f03fd4bc3f3f3f3f3f3fc4d1a4e6daf7a7e4c4d13f3f3febc43f3f
UTF-8 쒀艶쒀埃렔렶쒔롍뤏①佃ゆ暄т佃찊종춲訥엥玲 1110110010010010100000001110100010001001101101101110110010010010100000001110010110011111100000111110101110100000100101001110101110100000101101101110110010010010100101001110101110100001100011011110101110100100100011111110001010010001101000001110010010111101100000111110001110000010100001101110011010011010100001001101000110000010111001001011110110000011111011001011000010001010111011001010001010000101111011001011011010110010111010001010100010100101111011001001011110100101111011111010011010101101 ec9280e889b6ec9280e59f83eba094eba0b6ec9294eba18deba48fe291a0e4bd83e38286e69a84d182e4bd83ecb08aeca285ecb6b2e8a8a5ec97a5efa6ad
UHC 쒀艶쒀埃렔렶쒔롍뤏①佃ゆ暄т佃찊종춲訥엥玲 101111101010110011100110111111011011111010101100111001001110111110001110101010011000111011000001101111101010110110001110110100111000111110111111101010001110011111101110111011001010101011100110111111011011111010101100111001001110111011101100101010011000111011000001101111101010110110001110110100101110110110111111101010001110011110111111 beace6fdbeace4ef8ea98ec1bead8ed38fbfa8e7eeecaae6fdbeace4eeeca98ec1bead8ed2edbfa8e7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)