To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???L[???L[^ 0011111100111111001111110100110001011011001111110011111100111111010011000101101101011110 3f3f3f4c5b3f3f3f4c5b5e
SJIS-WIN 狸旦続L[狸旦続L[^ 1001001001001011100100100101010110010001101100010100110001011011100100100100101110010010010101011001000110110001010011000101101101011110 924b925591b14c5b924b925591b14c5b5e
EUC-JP 狸旦続L[狸旦続L[^ 1100001110101100110000111011011011000010101100110100110001011011110000111010110011000011101101101100001010110011010011000101101101011110 c3acc3b6c2b34c5bc3acc3b6c2b34c5b5e
UTF-8 狸旦続L[狸旦続L[^ 1110011110001011101110001110011010010111101001101110011110110110100110100100110001011011111001111000101110111000111001101001011110100110111001111011011010011010010011000101101101011110 e78bb8e697a6e7b69a4c5be78bb8e697a6e7b69a4c5b5e
UHC 狸旦?L[狸旦?L[^ 110101111110000111010011101010010011111101001100010110111101011111100001110100111010100100111111010011000101101101011110 d7e1d3a93f4c5bd7e1d3a93f4c5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)