To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????M?????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111101001101001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f4d3f3f3f3f3f3f
SJIS-WIN 筌??認??腰???M筌?∥二?? 1110001010100011001111110011111110010100010001100011111100111111100011011001100000111111001111110011111101001101111000101010001100111111100000010110000110010011111100010011111100111111 e2a33f3f94463f3f8d983f3f3f4de2a33f816193f13f3f
EUC-JP 筌??認??腰???M筌?‖二?? 1110010010100101001111110011111111000111101001110011111100111111101110011111100000111111001111110011111101001101111001001010010100111111101000011100001011000110111100110011111100111111 e4a53f3fc7a73f3fb9f83f3f3f4de4a53fa1c2c6f33f3f
UTF-8 筌뚮뱷認뗥끀腰민띔땋M筌뗫∥二꿩에 11100111101011011000110011101011100110101010111011101011101100011011011111101000101010101000110111101011100101111010010111101011100000011000000011101000100001011011000011101011101011111011110011101011100111011001010011101011100101011000101101001101111001111010110110001100111010111001011110101011111000101000100010100101111001001011101010001100111010101011111110101001111011001001011110010000 e7ad8ceb9aaeebb1b7e8aa8deb97a5eb8180e885b0ebafbceb9d94eb958b4de7ad8ceb97abe288a5e4ba8ceabfa9ec9790
UHC 筌뚮뱷認뗥끀腰민띔땋M筌뗫∥二꿩에 111011111010011110001100111010111001001110011101111011001110001110001011111001011000010110110110111010011010011010111001110011101011011011101010101101101010011001001101111011111010011110001011111010111010000110101011111011001010001110110010111001101011111110100001 efa78ceb939dece38be585b6e9a6b9ceb6eab6a64defa78beba1abeca3b2e6bfa1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)