To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??宜??臾??艶l?猷?????魚?? 11100001100111110011111100111111100010110101100000111111001111111110010001101011001111110011111110001001100100001000001010001100001111111001011101010001001111110011111100111111001111110011111110001011100110110011111100111111 e19f3f3f8b583f3fe46b3f3f8990828c3f97513f3f3f3f3f8b9b3f3f
EUC-JP 癲??宜??臾??艶l?猷??洹??魚?? 111000101010000100111111001111111011010110111001001111110011111111100111110011000011111100111111101100011111000010100011111011000011111111001101101100100011111100111111100011111100011110111010001111110011111110110101111110110011111100111111 e2a13f3fb5b93f3fe7cc3f3fb1f0a3ec3fcdb23f3f8fc7ba3f3fb5fb3f3f
UTF-8 癲덈챶宜백춳臾됰짎艶l뮆猷녺솻洹섎쳴魚좏룞 111001111001100110110010111010111000110110001000111011001011000110110110111001011010111010011100111010111011000010110001111011001011011010110011111010001000011110111110111010111001000010110000111011001010011110001110111010001000100110110110111011111011110110001100111010111010111010000110111001111000110010110111111010111000010110111010111011001000011010111011111001101011010010111001111011001000010010001110111011001011001110110100111010011010110110011010111011001010001010001111111010111010001110011110 e799b2eb8d88ecb1b6e5ae9cebb0b1ecb6b3e887beeb90b0eca78ee889b6efbd8cebae86e78cb7eb85baec86bbe6b4b9ec848eecb3b4e9ad9aeca28feba39e
UHC 癲덈챶宜백춳臾됰짎艶l뮆猷녺솻洹섎쳴魚좏룞 111011111010011010001000111010111010101010000011111010111111000110111001111010011010110110001111111010111010110010001001111010111010001110011010111001101111110110100011111011001001001010010101111010111010001110000110111001111001100110110000111010101011011110011000111010111010101110010111111001011110000010100000111011011000111110011001 efa688ebaa83ebf1b9e9ad8febac89eba39ae6fda3ec9295eba386e799b0eab798ebab97e5e0a0ed8f99

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)