To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????R?????????x 0011111100111111001111110011111100111111001111110011111100111111001111110101001000111111001111110011111100111111001111110011111100111111001111110011111101111000 3f3f3f3f3f3f3f3f3f523f3f3f3f3f3f3f3f3f78
SJIS-WIN 誰遜他誰遜足竪湛束R誰遜他誰遜足竪湛束x 1001001001001110100100011011101110010001101111001001001001001110100100011011101110010001101010111001001001000111100100100101100010010001101010010101001010010010010011101001000110111011100100011011110010010010010011101001000110111011100100011010101110010010010001111001001001011000100100011010100101111000 924e91bb91bc924e91bb91ab9247925891a952924e91bb91bc924e91bb91ab9247925891a978
EUC-JP 誰遜他誰遜足竪湛束R誰遜他誰遜足竪湛束x 1100001110101111110000101011110111000010101111101100001110101111110000101011110111000010101011011100001110101000110000111011100111000010101010110101001011000011101011111100001010111101110000101011111011000011101011111100001010111101110000101010110111000011101010001100001110111001110000101010101101111000 c3afc2bdc2bec3afc2bdc2adc3a8c3b9c2ab52c3afc2bdc2bec3afc2bdc2adc3a8c3b9c2ab78
UTF-8 誰遜他誰遜足竪湛束R誰遜他誰遜足竪湛束x 1110100010101010101100001110100110000001100111001110010010111011100101101110100010101010101100001110100110000001100111001110100010110110101100111110011110101011101010101110011010111001100110111110011010011101100111110101001011101000101010101011000011101001100000011001110011100100101110111001011011101000101010101011000011101001100000011001110011101000101101101011001111100111101010111010101011100110101110011001101111100110100111011001111101111000 e8aab0e9819ce4bb96e8aab0e9819ce8b6b3e7abaae6b99be69d9f52e8aab0e9819ce4bb96e8aab0e9819ce8b6b3e7abaae6b99be69d9f78
UHC 誰遜他誰遜足竪湛束R誰遜他誰遜足竪湛束x 1110001011000001111000011110000111110110111000101110001011000001111000011110000111110000111010111110001010110101110100111100000011100001110101100101001011100010110000011110000111100001111101101110001011100010110000011110000111100001111100001110101111100010101101011101001111000000111000011101011001111000 e2c1e1e1f6e2e2c1e1e1f0ebe2b5d3c0e1d652e2c1e1e1f6e2e2c1e1e1f0ebe2b5d3c0e1d678

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)