To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪旦臓脱尊揃誰遜造誰遜測竪担即誰遜卒 100100100100011110010010010101011001000110011111100100100100010110010001101110001001000110110101100100100100111010010001101110111001000110100010100100100100111010010001101110111001000110101010100100100100011110010010010100111001000110100110100100100100111010010001101110111001000110110010 92479255919f924591b891b5924e91bb91a2924e91bb91aa9247925391a6924e91bb91b2
EUC-JP 竪旦臓脱尊揃誰遜造誰遜測竪担即誰遜卒 110000111010100011000011101101101100001010100001110000111010011011000010101110101100001010110111110000111010111111000010101111011100001010100100110000111010111111000010101111011100001010101100110000111010100011000011101101001100001010101000110000111010111111000010101111011100001010110100 c3a8c3b6c2a1c3a6c2bac2b7c3afc2bdc2a4c3afc2bdc2acc3a8c3b4c2a8c3afc2bdc2b4
UTF-8 竪旦臓脱尊揃誰遜造誰遜測竪担即誰遜卒 111001111010101110101010111001101001011110100110111010001000011110010011111010001000010010110001111001011011000010001010111001101000111110000011111010001010101010110000111010011000000110011100111010011000000010100000111010001010101010110000111010011000000110011100111001101011100010101100111001111010101110101010111001101000101110000101111001011000110110110011111010001010101010110000111010011000000110011100111001011000110110010010 e7abaae697a6e88793e884b1e5b08ae68f83e8aab0e9819ce980a0e8aab0e9819ce6b8ace7abaae68b85e58db3e8aab0e9819ce58d92
UHC 竪旦??尊?誰遜造誰遜測竪??誰遜卒 11100010101101011101001110101001001111110011111111110000111011100011111111100010110000011110000111100001111100001110001111100010110000011110000111100001111101101011010011100010101101010011111100111111111000101100000111100001111000011111000011101111 e2b5d3a93f3ff0ee3fe2c1e1e1f0e3e2c1e1e1f6b4e2b53f3fe2c1e1e1f0ef

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)