To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪旦臓脱蔵側誰遜捉誰遜即辿束卒誰遜則 100100100100011110010010010101011001000110011111100100100100010110010001101000001001000110100100100100100100111010010001101110111001000110101000100100100100111010010001101110111001000110100110100100100100100010010001101010011001000110110010100100100100111010010001101110111001000110100101 92479255919f924591a091a4924e91bb91a8924e91bb91a6924891a991b2924e91bb91a5
EUC-JP 竪旦臓脱蔵側誰遜捉誰遜即辿束卒誰遜則 110000111010100011000011101101101100001010100001110000111010011011000010101000101100001010100110110000111010111111000010101111011100001010101010110000111010111111000010101111011100001010101000110000111010100111000010101010111100001010110100110000111010111111000010101111011100001010100111 c3a8c3b6c2a1c3a6c2a2c2a6c3afc2bdc2aac3afc2bdc2a8c3a9c2abc2b4c3afc2bdc2a7
UTF-8 竪旦臓脱蔵側誰遜捉誰遜即辿束卒誰遜則 111001111010101110101010111001101001011110100110111010001000011110010011111010001000010010110001111010001001010010110101111001011000000110110100111010001010101010110000111010011000000110011100111001101000110110001001111010001010101010110000111010011000000110011100111001011000110110110011111010001011111010111111111001101001110110011111111001011000110110010010111010001010101010110000111010011000000110011100111001011000100110000111 e7abaae697a6e88793e884b1e894b5e581b4e8aab0e9819ce68d89e8aab0e9819ce58db3e8bebfe69d9fe58d92e8aab0e9819ce58987
UHC 竪旦???側誰遜捉誰遜??束卒誰遜則 11100010101101011101001110101001001111110011111100111111111101101011000011100010110000011110000111100001111100111011010111100010110000011110000111100001001111110011111111100001110101101111000011101111111000101100000111100001111000011111011011001110 e2b5d3a93f3f3ff6b0e2c1e1e1f3b5e2c1e1e13f3fe1d6f0efe2c1e1e1f6ce

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)