To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??意??蟻??奄??萸??懿??鸚?? 11100001100111110011111100111111100010001101001100111111001111111000101101100001001111110011111110001001100000100011111100111111111001001100111000111111001111111001110011110010001111110011111111101010010111110011111100111111 e19f3f3f88d33f3f8b613f3f89823f3fe4ce3f3f9cf23f3fea5f3f3f
EUC-JP 癲??意??蟻??奄??萸??懿??鸚?? 11100010101000010011111100111111101100001101010100111111001111111011010111000010001111110011111110110001111000100011111100111111111010001101000000111111001111111101100011110100001111110011111111110011110000000011111100111111 e2a13f3fb0d53f3fb5c23f3fb1e23f3fe8d03f3fd8f43f3ff3c03f3f
UTF-8 癲뉖낄意좈씣蟻륂뒍奄몃굟萸띌펶懿몄돪鸚룹틳 111001111001100110110010111010111000100110010110111010111000001010000100111001101000010010001111111011001010001010001000111011001001010010100011111010001001111110111011111010111010010110000010111010111001001010001101111001011010010110000100111010111010101010000011111010101011010110011111111010001001000010111000111010111001110110001100111011011000111010110110111001101000011110111111111010111010101010000100111010111000111110101010111010011011100010011010111010111010001110111001111011011000101110110011 e799b2eb8996eb8284e6848feca288ec94a3e89fbbeba582eb928de5a584ebaa83eab59fe890b8eb9d8ced8eb6e687bfebaa84eb8faae9b89aeba3b9ed8bb3
UHC 癲뉖낄意좈씣蟻륂뒍奄몃굟萸띌펶懿몄돪鸚룹틳 111011111010011010000111111010111011001110100101111010111111001010100000111010011001110110110111111010111111110010001111111011011000101010001010111001011111001010111000111010111000001010000111111010111010110110110110111010011011110010000111111010111111001110111000111011001000100110101101111001011010010010110111111011001011101010011011 efa687ebb3a5ebf2a0e99db7ebfc8fed8a8ae5f2b8eb8287ebadb6e9bc87ebf3b8ec89ade5a4b7ecba9b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)