To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 塋ゆ?熬?オ蘊??渦 10011010110010001000001011100100001111111110000010010010001111111000001101001001111001010101110100111111001111111000100101010001 9ac882e43fe0923f8349e55d3f3f8951
EUC-JP 塋ゆ?熬?オ蘊??渦 11010100110010101010010011100110001111111101111111110010001111111010010110101010111010011011111000111111001111111011000110110010 d4caa4e63fdff23fa5aae9be3f3fb1b2
UTF-8 塋ゆ짎熬뽫オ蘊딀뜆渦 111001011010000110001011111000111000001010000110111011001010011110001110111001111000011010101100111010111011110110101011111000111000001010101010111010001001100010001010111010111001010010000000111010111001110010000110111001101011100010100110 e5a18be38286eca78ee786acebbdabe382aae8988aeb9480eb9c86e6b8a6
UHC 塋ゆ짎熬뽫オ蘊딀뜆渦 1110011110101011101010101110011010100011100110101110100010100010100101101110011110101011101010101110100010110011100010101110011010001101100010011110100010111110 e7abaae6a39ae8a296e7abaae8b38ae68d89e8be

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)