To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??爰???щ?歪??猷η?鷹??塋 111000011001111100111111001111111110000010100111001111110011111100111111100001001000101100111111100110000110001100111111001111111001011101010001100000111100010100111111100100011110100100111111001111111001101011001000 e19f3f3fe0a73f3f3f848b3f98633f3f975183c53f91e93f3f9ac8
EUC-JP 癲??爰??蓀щ?歪??猷η?鷹??塋 1110001010100001001111110011111111100000101010010011111100111111100011111101100011111000101001111110101100111111110011111100010000111111001111111100110110110010101001101100011100111111110000101110101100111111001111111101010011001010 e2a13f3fe0a93f3f8fd8f8a7eb3fcfc43f3fcdb2a6c73fc2eb3f3fd4ca
UTF-8 癲ㅺ퓭爰귝끽蓀щ쨨歪묆굥猷η뛾鷹꾪돫塋 11100111100110011011001011100011100001011011101011101101100100111010110111100111100010001011000011101010101101111001110111101011100000011011110111101000100100111000000011010001100010011110110010101000101010001110011010101101101010101110101110101100100001101110101010110101101001011110011110001100101101111100111010110111111010111001101110111110111010011011011110111001111010101011111010101010111010111000111110101011111001011010000110001011 e799b2e385baed93ade788b0eab79deb81bde89380d189eca8a8e6adaaebac86eab5a5e78cb7ceb7eb9bbee9b7b9eabeaaeb8fabe5a18b
UHC 癲ㅺ퓭爰귝끽蓀щ쨨歪묆굥猷η뛾鷹꾪돫塋 1110111110100110101001001110101010111111100101001110101010111010100000101110011010110011101000111110000111100000101011001110101110100100100000111110100011100000100100011110001110000010100010111110101110100011101001011110011110001101100001001110101111101101100001001110110110001001101011101110011110101011 efa6a4eabf94eaba82e6b3a3e1e0aceba483e8e091e3828beba3a5e78d84ebed84ed89aee7ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)