To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ??ゅ?訥ぢ??び 00111111001111111000001011100011001111111110011001100011100000101100000000111111001111111000001011010001 3f3f82e33fe66382c03f3f82d1
EUC-JP ??ゅ?訥ぢ??び 00111111001111111010010011100101001111111110101111000100101001001100001000111111001111111010010011010011 3f3fa4e53febc4a4c23f3fa4d3
UTF-8 룵퓦ゅ룶訥ぢ룶죴び 111010111010001110110101111011011001001110100110111000111000001010000101111010111010001110110110111010001010100010100101111000111000000110100010111010111010001110110110111011001010001110110100111000111000000110110011 eba3b5ed93a6e38285eba3b6e8a8a5e381a2eba3b6eca3b4e381b3
UHC 룵퓦ゅ룶訥ぢ룶죴び 100011111010101010111111100011111010101011100101100011111010101111010010111011011010101011000010100011111010101110100001100011111010101011010011 8faabf8faae58fabd2edaac28faba18faad3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)