To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???d?????????d??????B 001111110011111100111111011001000011111100111111001111110011111100111111001111110011111100111111001111110110010000111111001111110011111100111111001111110011111101000010 3f3f3f643f3f3f3f3f3f3f3f3f643f3f3f3f3f3f42
SJIS-WIN テつ竪dテつ辿テつ樽テつ竪dテつ辿テつ樽B 110000111000001011000010100100100100011101100100110000111000001011000010100100100100100011000011100000101100001010010010010011011100001110000010110000101001001001000111011001001100001110000010110000101001001001001000110000111000001011000010100100100100110101000010 c382c2924764c382c29248c382c2924dc382c2924764c382c29248c382c2924d42
EUC-JP テつ竪dテつ辿テつ樽テつ竪dテつ辿テつ樽B 100011101100001110100100110001001100001110101000011001001000111011000011101001001100010011000011101010011000111011000011101001001100010011000011101011101000111011000011101001001100010011000011101010000110010010001110110000111010010011000100110000111010100110001110110000111010010011000100110000111010111001000010 8ec3a4c4c3a8648ec3a4c4c3a98ec3a4c4c3ae8ec3a4c4c3a8648ec3a4c4c3a98ec3a4c4c3ae42
UTF-8 テつ竪dテつ辿テつ樽テつ竪dテつ辿テつ樽B 111011111011111010000011111000111000000110100100111001111010101110101010011001001110111110111110100000111110001110000001101001001110100010111110101111111110111110111110100000111110001110000001101001001110011010101000101111011110111110111110100000111110001110000001101001001110011110101011101010100110010011101111101111101000001111100011100000011010010011101000101111101011111111101111101111101000001111100011100000011010010011100110101010001011110101000010 efbe83e381a4e7abaa64efbe83e381a4e8bebfefbe83e381a4e6a8bdefbe83e381a4e7abaa64efbe83e381a4e8bebfefbe83e381a4e6a8bd42
UHC ?つ竪d?つ??つ樽?つ竪d?つ??つ樽B 00111111101010101100010011100010101101010110010000111111101010101100010000111111001111111010101011000100111100011101110000111111101010101100010011100010101101010110010000111111101010101100010000111111001111111010101011000100111100011101110001000010 3faac4e2b5643faac43f3faac4f1dc3faac4e2b5643faac43f3faac4f1dc42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)