To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 竪淡属誰遜即辿息促辰存損誰遜損誰遜尊 100100100100011110010010010101111001000110101110100100100100111010010001101110111001000110100110100100100100100010010001101001111001000110100011100100100100001110010001101101101001000110111001100100100100111010010001101110111001000110111001100100100100111010010001101110111001000110111000 9247925791ae924e91bb91a6924891a791a3924391b691b9924e91bb91b9924e91bb91b8
EUC-JP 竪淡属誰遜即辿息促辰存損誰遜損誰遜尊 110000111010100011000011101110001100001010110000110000111010111111000010101111011100001010101000110000111010100111000010101010011100001010100101110000111010010011000010101110001100001010111011110000111010111111000010101111011100001010111011110000111010111111000010101111011100001010111010 c3a8c3b8c2b0c3afc2bdc2a8c3a9c2a9c2a5c3a4c2b8c2bbc3afc2bdc2bbc3afc2bdc2ba
UTF-8 竪淡属誰遜即辿息促辰存損誰遜損誰遜尊 111001111010101110101010111001101011011110100001111001011011000110011110111010001010101010110000111010011000000110011100111001011000110110110011111010001011111010111111111001101000000110101111111001001011111110000011111010001011111010110000111001011010110110011000111001101001000010001101111010001010101010110000111010011000000110011100111001101001000010001101111010001010101010110000111010011000000110011100111001011011000010001010 e7abaae6b7a1e5b19ee8aab0e9819ce58db3e8bebfe681afe4bf83e8beb0e5ad98e6908de8aab0e9819ce6908de8aab0e9819ce5b08a
UHC 竪淡?誰遜??息促辰存損誰遜損誰遜尊 111000101011010111010011101111110011111111100010110000011110000111100001001111110011111111100011110100111111010110110101111100101110001111110000111011011110000111011111111000101100000111100001111000011110000111011111111000101100000111100001111000011111000011101110 e2b5d3bf3fe2c1e1e13f3fe3d3f5b5f2e3f0ede1dfe2c1e1e1e1dfe2c1e1e1f0ee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)