To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????R????^[????R????^[^ 0011111100111111001111110011111101010010001111110011111100111111001111110101111001011011001111110011111100111111001111110101001000111111001111110011111100111111010111100101101101011110 3f3f3f3f523f3f3f3f5e5b3f3f3f3f523f3f3f3f5e5b5e
SJIS-WIN 陝カ蜀「R陝カ蜀「^[陝カ蜀「R陝カ蜀「^[^ 11101000100111111011011011100101100001101010001001010010111010001001111110110110111001011000011010100010010111100101101111101000100111111011011011100101100001101010001001010010111010001001111110110110111001011000011010100010010111100101101101011110 e89fb6e586a252e89fb6e586a25e5be89fb6e586a252e89fb6e586a25e5b5e
EUC-JP 陝カ蜀「R陝カ蜀「^[陝カ蜀「R陝カ蜀「^[^ 111100001010000110001110101101101110100111100110100011101010001001010010111100001010000110001110101101101110100111100110100011101010001001011110010110111111000010100001100011101011011011101001111001101000111010100010010100101111000010100001100011101011011011101001111001101000111010100010010111100101101101011110 f0a18eb6e9e68ea252f0a18eb6e9e68ea25e5bf0a18eb6e9e68ea252f0a18eb6e9e68ea25e5b5e
UTF-8 陝カ蜀「R陝カ蜀「^[陝カ蜀「R陝カ蜀「^[^ 11101001100110011001110111101111101111011011011011101000100111001000000011101111101111011010001001010010111010011001100110011101111011111011110110110110111010001001110010000000111011111011110110100010010111100101101111101001100110011001110111101111101111011011011011101000100111001000000011101111101111011010001001010010111010011001100110011101111011111011110110110110111010001001110010000000111011111011110110100010010111100101101101011110 e9999defbdb6e89c80efbda252e9999defbdb6e89c80efbda25e5be9999defbdb6e89c80efbda252e9999defbdb6e89c80efbda25e5b5e
UHC 陝?蜀?R陝?蜀?^[陝?蜀?R陝?蜀?^[^ 11100000111011010011111111110101101110010011111101010010111000001110110100111111111101011011100100111111010111100101101111100000111011010011111111110101101110010011111101010010111000001110110100111111111101011011100100111111010111100101101101011110 e0ed3ff5b93f52e0ed3ff5b93f5e5be0ed3ff5b93f52e0ed3ff5b93f5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)