To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???U}???U{^ 0011111100111111001111110101010101111101001111110011111100111111010101010111101101011110 3f3f3f557d3f3f3f557b5e
SJIS-WIN 萬緒愁U}萬緒愁U{^ 1110010011011101100011111000111110001111010001000101010101111101111001001101110110001111100011111000111101000100010101010111101101011110 e4dd8f8f8f44557de4dd8f8f8f44557b5e
EUC-JP 萬緒愁U}萬緒愁U{^ 1110100011011111101111011110111110111101101001010101010101111101111010001101111110111101111011111011110110100101010101010111101101011110 e8dfbdefbda5557de8dfbdefbda5557b5e
UTF-8 萬緒愁U}萬緒愁U{^ 1110100010010000101011001110011110110111100100101110011010000100100000010101010101111101111010001001000010101100111001111011011110010010111001101000010010000001010101010111101101011110 e890ace7b792e68481557de890ace7b792e68481557b5e
UHC 萬?愁U}萬?愁U{^ 110110001011111100111111111000011111111001010101011111011101100010111111001111111110000111111110010101010111101101011110 d8bf3fe1fe557dd8bf3fe1fe557b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)