To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鵝??肄??淫??癰??愉?┼膺??繹 111010100100000000111111001111111110001111100101001111110011111110001000111110100011111100111111111000011001111000111111001111111001011011111001001111111000010010101001111001000101111000111111001111111110001110001000 ea403f3fe3e53f3f88fa3f3fe19e3f3f96f93f84a9e45e3f3fe388
EUC-JP 鵝??肄??淫??癰??愉?┼膺??繹 111100111010000100111111001111111110011011100111001111110011111110110000111111000011111100111111111000011111111000111111001111111100110011111011001111111010100010101011111001111011111100111111001111111110010111101000 f3a13f3fe6e73f3fb0fc3f3fe1fe3f3fccfb3fa8abe7bf3f3fe5e8
UTF-8 鵝숈뮆肄덃끽淫됰껜癰귘뫗愉놂┼膺덈꺄繹 111010011011010110011101111011001000100010001000111010111010111010000110111010001000001010000100111010111000110110000011111010111000000110111101111001101011011110101011111010111001000010110000111010101011101110011100111001111001100110110000111010101011011110011000111010111010101110010111111001101000010010001001111010111000011010000010111000101001010010111100111010001000011010111010111010111000110110001000111010101011101010000100111001111011100110111001 e9b59dec8888ebae86e88284eb8d83eb81bde6b7abeb90b0eabb9ce799b0eab798ebab97e68489eb8682e294bce886baeb8d88eaba84e7b9b9
UHC 鵝숈뮆肄덃끽淫됰껜癰귘뫗愉놂┼膺덈꺄繹 1110010010111101100110011110110010010010100101011110110010111101100010001110011010110011101000111110101111100010100010011110101110110010101101001110100010111001100000101110001010010001101110011110101011110000101100111110111110100110101010111110101111101100100010001110101110110010101001011110011010111010 e4bd99ec9295ecbd88e6b3a3ebe289ebb2b4e8b982e291b9eaf0b3efa6abebec88ebb2a5e6ba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)