To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 畏??謠??絶??腸↓?絶?ぜ節??嵬??^ 10001000110110000011111100111111111001101000111100111111001111111001000011100010001111110011111110010010101100001000000110101011001111111001000011100010001111111000001010111010100100001101111100111111001111111001101111001010001111110011111101011110 88d83f3fe68f3f3f90e23f3f92b081ab3f90e23f82ba90df3f3f9bca3f3f5e
EUC-JP 畏??謠??絶??腸↓?絶?ぜ節??嵬??^ 10110000110110100011111100111111111010111110111100111111001111111100000011100100001111110011111111000100101100101010001010101101001111111100000011100100001111111010010010111100110000001110000100111111001111111101011011001100001111110011111101011110 b0da3f3febef3f3fc0e43f3fc4b2a2ad3fc0e43fa4bcc0e13f3fd6cc3f3f5e
UTF-8 畏쇽풕謠쇽쉽絶덆뀼腸↓댘絶섌ぜ節깍푽嵬뀌븶^ 11100111100101011000111111101100100001111011110111101101100100101001010111101000101011001010000011101100100001111011110111101100100010011011110111100111101101011011011011101011100011011000011011101011100000001011110011101000100001011011100011100010100001101001001111101011100011001001100011100111101101011011011011101100100001001000110011100011100000011001110011100111101011111000000011101010101110011000110111101101100100011011110111100101101101011010110011101011100000001000110011101011101110001011011001011110 e7958fec87bded9295e8aca0ec87bdec89bde7b5b6eb8d86eb80bce885b8e28693eb8c98e7b5b6ec848ce3819ce7af80eab98ded91bde5b5aceb808cebb8b65e
UHC 畏쇽풕謠쇽쉽絶덆뀼腸↓댘絶섌ぜ節깍푽嵬뀌븶^ 11101000111001101011110011101111101111101001100011101001101010101011110011101111101111011011000111101111101111101000100011101001100001011011001011101101111100111010000111101001100010001011110011101111101111101001100011101001101010101011110011101111101111011011000111101111101111101000100011101000111000111011001011101110100101011001111101011110 e8e6bcefbe98e9aabcefbdb1efbe88e985b2edf3a1e988bcefbe98e9aabcefbdb1efbe88e8e3b2ee959f5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)