To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????BB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100001001000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4242
SJIS-WIN 竪多俗誰遜即竪多捉誰遜息竪他揃誰遜村BB 1001001001000111100100011011110110010001101011011001001001001110100100011011101110010001101001101001001001000111100100011011110110010001101010001001001001001110100100011011101110010001101001111001001001000111100100011011110010010001101101011001001001001110100100011011101110010001101110100100001001000010 924791bd91ad924e91bb91a6924791bd91a8924e91bb91a7924791bc91b5924e91bb91ba4242
EUC-JP 竪多俗誰遜即竪多捉誰遜息竪他揃誰遜村BB 1100001110101000110000101011111111000010101011111100001110101111110000101011110111000010101010001100001110101000110000101011111111000010101010101100001110101111110000101011110111000010101010011100001110101000110000101011111011000010101101111100001110101111110000101011110111000010101111000100001001000010 c3a8c2bfc2afc3afc2bdc2a8c3a8c2bfc2aac3afc2bdc2a9c3a8c2bec2b7c3afc2bdc2bc4242
UTF-8 竪多俗誰遜即竪多捉誰遜息竪他揃誰遜村BB 1110011110101011101010101110010110100100100110101110010010111111100101111110100010101010101100001110100110000001100111001110010110001101101100111110011110101011101010101110010110100100100110101110011010001101100010011110100010101010101100001110100110000001100111001110011010000001101011111110011110101011101010101110010010111011100101101110011010001111100000111110100010101010101100001110100110000001100111001110011010011101100100010100001001000010 e7abaae5a49ae4bf97e8aab0e9819ce58db3e7abaae5a49ae68d89e8aab0e9819ce681afe7abaae4bb96e68f83e8aab0e9819ce69d914242
UHC 竪多俗誰遜?竪多捉誰遜息竪他?誰遜村BB 111000101011010111010010111111011110000111010100111000101100000111100001111000010011111111100010101101011101001011111101111100111011010111100010110000011110000111100001111000111101001111100010101101011111011011100010001111111110001011000001111000011110000111110101101111010100001001000010 e2b5d2fde1d4e2c1e1e13fe2b5d2fdf3b5e2c1e1e1e3d3e2b5f6e23fe2c1e1e1f5bd4242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)