To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??±???????????±?????????B 00111111001111111011000100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111011000100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3fb13f3f3f3f3f3f3f3f3f3f3fb13f3f3f3f3f3f3f3f3f42
SJIS-WIN 癲?±違х?純??筌??癲?±違х?純??筌??B 11100001100111110011111110000001011111011000100011100001100001001000011100111111100011111000001100111111001111111110001010100011001111110011111111100001100111110011111110000001011111011000100011100001100001001000011100111111100011111000001100111111001111111110001010100011001111110011111101000010 e19f3f817d88e184873f8f833f3fe2a33f3fe19f3f817d88e184873f8f833f3fe2a33f3f42
EUC-JP 癲?±違х?純??筌??癲?±違х?純??筌??B 11100010101000010011111110100001110111101011000011100011101001111110011100111111101111011110001100111111001111111110010010100101001111110011111111100010101000010011111110100001110111101011000011100011101001111110011100111111101111011110001100111111001111111110010010100101001111110011111101000010 e2a13fa1deb0e3a7e73fbde33f3fe4a53f3fe2a13fa1deb0e3a7e73fbde33f3fe4a53f3f42
UTF-8 癲됱±違х쨼純앹뜗筌뗫첁癲됱±違х쨼純앹뜗筌뗫첁B 111001111001100110110010111010111001000010110001110000101011000111101001100000011001010111010001100001011110110010101000101111001110011110110100100101001110110010010101101110011110101110011100100101111110011110101101100011001110101110010111101010111110110010110010100000011110011110011001101100101110101110010000101100011100001010110001111010011000000110010101110100011000010111101100101010001011110011100111101101001001010011101100100101011011100111101011100111001001011111100111101011011000110011101011100101111010101111101100101100101000000101000010 e799b2eb90b1c2b1e98195d185eca8bce7b494ec95b9eb9c97e7ad8ceb97abecb281e799b2eb90b1c2b1e98195d185eca8bce7b494ec95b9eb9c97e7ad8ceb97abecb28142
UHC 癲됱±違х쨼純앹뜗筌뗫첁癲됱±違х쨼純앹뜗筌뗫첁B 11101111101001101000100111101100101000011011111011101010110111101010110011100111101001001001011011100010111011011001110111101100100011011001101011101111101001111000101111101011101010101000111011101111101001101000100111101100101000011011111011101010110111101010110011100111101001001001011011100010111011011001110111101100100011011001101011101111101001111000101111101011101010101000111001000010 efa689eca1beeadeace7a496e2ed9dec8d9aefa78bebaa8eefa689eca1beeadeace7a496e2ed9dec8d9aefa78bebaa8e42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)