To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????^ 00111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f5e
SJIS-WIN 襪ョ桑襪ョ桑^ 11100110010000111000001110000111100011000100101111100110010000111000001110000111100011000100101101011110 e64383878c4be64383878c4b5e
EUC-JP 襪ョ桑襪ョ桑^ 11101011101001001010010111100111101101111010110011101011101001001010010111100111101101111010110001011110 eba4a5e7b7aceba4a5e7b7ac5e
UTF-8 襪ョ桑襪ョ桑^ 11101000101001011010101011100011100000111010011111100110101000011001000111101000101001011010101011100011100000111010011111100110101000011001000101011110 e8a5aae383a7e6a191e8a5aae383a7e6a1915e
UHC 襪ョ桑襪ョ桑^ 11011000110010101010101111100111110111111100110111011000110010101010101111100111110111111100110101011110 d8caabe7dfcdd8caabe7dfcd5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)