To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8宜??音??馭??日??矣???ν? 1110000110011111001111111000001001010111100010110101100000111111001111111000100110111001001111110011111111101001011001100011111100111111100100111111101000111111001111111110000111100001001111110011111100111111100000111100101100111111 e19f3f82578b583f3f89b93f3fe9663f3f93fa3f3fe1e13f3f3f83cb3f
EUC-JP 癲?8宜??音??馭??日??矣???ν? 1110001010100001001111111010001110111000101101011011100100111111001111111011001010111011001111110011111111110001110001110011111100111111110001101111110000111111001111111110001011100011001111110011111100111111101001101100110100111111 e2a13fa3b8b5b93f3fb2bb3f3ff1c73f3fc6fc3f3fe2e33f3f3fa6cd3f
UTF-8 癲쒕8宜룩눧音쀬뵯馭귙꺈日뗧춯矣뚯탟若ν룤 1110011110011001101100101110110010010010100101011110111110111100100110001110010110101110100111001110101110100011101010011110101110001000101001111110100110011111101100111110110010000000101011001110101110110101101011111110100110100110101011011110101010110111100110011110101010111010100010001110011010010111101001011110101110010111101001111110110010110110101011111110011110011111101000111110101110011010101011111110110110000011100111111110111110100101101101001100111010111101111010111010001110100100 e799b2ec9295efbc98e5ae9ceba3a9eb88a7e99fb3ec80acebb5afe9a6adeab799eaba88e697a5eb97a7ecb6afe79fa3eb9aafed839fefa5b4cebdeba3a4
UHC 癲쒕8宜룩눧音쀬뵯馭귙꺈日뗧춯矣뚯탟若ν룤 111011111010011010011100111010111010001110111000111010111111000110110111111010001000011110111110111010111110010110010111111011001001010010101101111001011101111110000010111000111000001110101111111011001110110110001011111001111010110110001100111010111111100010001100111011001011010110000011111001011010111010100101111011011000111110011101 efa69ceba3b8ebf1b7e887beebe597ec94ade5df82e383afeced8be7ad8cebf88cecb583e5aea5ed8f9d

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)