To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪旦臓誰遜袖竪担他脱促蔵辿続息辿即他 100100100100011110010010010101011001000110011111100100100100111010010001101110111001000110110011100100100100011110010010010100111001000110111100100100100100010110010001101000111001000110100000100100100100100010010001101100011001000110100111100100100100100010010001101001101001000110111100 92479255919f924e91bb91b39247925391bc924591a391a0924891b191a7924891a691bc
EUC-JP 竪旦臓誰遜袖竪担他脱促蔵辿続息辿即他 110000111010100011000011101101101100001010100001110000111010111111000010101111011100001010110101110000111010100011000011101101001100001010111110110000111010011011000010101001011100001010100010110000111010100111000010101100111100001010101001110000111010100111000010101010001100001010111110 c3a8c3b6c2a1c3afc2bdc2b5c3a8c3b4c2bec3a6c2a5c2a2c3a9c2b3c2a9c3a9c2a8c2be
UTF-8 竪旦臓誰遜袖竪担他脱促蔵辿続息辿即他 111001111010101110101010111001101001011110100110111010001000011110010011111010001010101010110000111010011000000110011100111010001010001010010110111001111010101110101010111001101000101110000101111001001011101110010110111010001000010010110001111001001011111110000011111010001001010010110101111010001011111010111111111001111011011010011010111001101000000110101111111010001011111010111111111001011000110110110011111001001011101110010110 e7abaae697a6e88793e8aab0e9819ce8a296e7abaae68b85e4bb96e884b1e4bf83e894b5e8bebfe7b69ae681afe8bebfe58db3e4bb96
UHC 竪旦?誰遜袖竪?他?促???息??他 11100010101101011101001110101001001111111110001011000001111000011110000111100010110000001110001010110101001111111111011011100010001111111111010110110101001111110011111100111111111000111101001100111111001111111111011011100010 e2b5d3a93fe2c1e1e1e2c0e2b53ff6e23ff5b53f3f3fe3d33f3ff6e2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)