To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??意??醫??庸??揄??猷〓?筌??? 11100001100111110011111100111111100010001101001100111111001111111110011111001110001111111000000101001000100101110110011000111111001111111001110110001001001111110011111110010111010100011000000110101100001111111110001010100011001111110011111100111111 e19f3f3f88d33f3fe7ce3f814897663f3f9d893f3f975181ac3fe2a33f3f3f
EUC-JP 癲??意??醫??庸??揄??猷〓?筌?Ł? 111000101010000100111111001111111011000011010101001111110011111111101110110100000011111110100001101010011100110111000111001111110011111111011001111010010011111100111111110011011011001010100010101011100011111111100100101001010011111110001111101010011010100000111111 e2a13f3fb0d53f3feed03fa1a9cdc73f3fd9e93f3fcdb2a2ae3fe4a53f8fa9a83f
UTF-8 癲ㅺ퀎意ㅶ벉醫딆?庸뉗렲揄됵쭔猷〓쑕筌뗫Ł理 1110011110011001101100101110001110000101101110101110110110000000100011101110011010000100100011111110001110000101101101101110101110110010100010011110100110000110101010111110101110010100100001101110111110111100100111111110010110111010101110001110101110001001100101111110101110100000101100101110011010001111100001001110101110010000101101011110110010101101100101001110011110001100101101111110001110000000100100111110110010010001100101011110011110101101100011001110101110010111101010111100010110000001111011111010011110100100 e799b2e385baed808ee6848fe385b6ebb289e986abeb9486efbc9fe5bab8eb8997eba0b2e68f84eb90b5ecad94e78cb7e38093ec9195e7ad8ceb97abc581efa7a4
UHC 癲ㅺ퀎意ㅶ벉醫딆?庸뉗렲揄됵쭔猷〓쑕筌뗫Ł理 1110111110100110101001001110101010110011100001001110101111110010101001001110011010010011101011001110110010100010100010101110110010100011101111111110100110111100100001111110110010001110101111111110101011110001100010011110111110100111100011001110101110100011101000011110101110011100101101001110111110100111100010111110101110101000101010011110110010110101 efa6a4eab384ebf2a4e693aceca28aeca3bfe9bc87ec8ebfeaf189efa78ceba3a1eb9cb4efa78beba8a9ecb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)