To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????B 00111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f42
SJIS-WIN 形ゆ?形ゆ?B 1000110001100000100000101110010000111111100011000110000010000010111001000011111101000010 8c6082e43f8c6082e43f42
EUC-JP 形ゆ?形ゆ?B 1011011111000001101001001110011000111111101101111100000110100100111001100011111101000010 b7c1a4e63fb7c1a4e63f42
UTF-8 形ゆ꼇形ゆ꼇B 11100101101111011010001011100011100000101000011011101010101111001000011111100101101111011010001011100011100000101000011011101010101111001000011101000010 e5bda2e38286eabc87e5bda2e38286eabc8742
UHC 形ゆ꼇形ゆ꼇B 11111011101000011010101011100110101100101011101111111011101000011010101011100110101100101011101101000010 fba1aae6b2bbfba1aae6b2bb42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)