To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲???鴉??泣??循??蘖??魏?㎞溢 11100001100111110011111100111111001111111110100111101011001111110011111110001011100000110011111100111111100011110111101000111111001111111001111101010000001111110011111111101001101100000011111110000111011100011000100011101100 e19f3f3f3fe9eb3f3f8b833f3f8f7a3f3f9f503f3fe9b03f877188ec
EUC-JP 癲???鴉??泣??循??蘖??魏??溢 111000101010000100111111001111110011111111110010111011010011111100111111101101011110001100111111001111111011110111011011001111110011111111011101101100010011111100111111111100101011001000111111001111111011000011101110 e2a13f3f3ff2ed3f3fb5e33f3fbddb3f3fddb13f3ff2b23f3fb0ee
UTF-8 癲ⓥ뫖흭鴉딅뛼泣앶굫循딅럡蘖뽰궛魏숋㎞溢 111001111001100110110010111000101001001110100101111010111010101110010110111011011001110110101101111010011011010010001001111010111001010010000101111010111001101110111100111001101011001110100011111011001001010110110110111010101011010110101011111001011011111010101010111010111001010010000101111010111001111110100001111010001001100010010110111010111011110110110000111010101011011010011011111010011010110110001111111011001000100010001011111000111000111010011110111001101011101010100010 e799b2e293a5ebab96ed9dade9b489eb9485eb9bbce6b3a3ec95b6eab5abe5beaaeb9485eb9fa1e89896ebbdb0eab69be9ad8fec888be38e9ee6baa2
UHC 癲ⓥ뫖흭鴉딅뛼泣앶굫循딅럡蘖뽰궛魏숋㎞溢 11101111101001101010100011100010100100011011100011000101100010011110010010111100100010101110101110001101100000101110101111101000100111011110100110000010100100011110001011100000100010101110101110001110100001001110010111101110100101101110110010000010101100001110101011100000100110011110111110100111101100001110110011101110 efa6a8e291b8c589e4bc8aeb8d82ebe89de98291e2e08aeb8e84e5ee96ec82b0eae099efa7b0ecee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)