To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 遵??畔?運蓑?良遵??畔?運蓑?粱^ 1000111110000101001111110011111110010100110010000011111110001001010111101001011010101010001111111001011111000111100011111000010100111111001111111001010011001000001111111000100101011110100101101010101000111111111000101110101101011110 8f853f3f94c83f895e96aa3f97c78f853f3f94c83f895e96aa3fe2eb5e
EUC-JP 遵??畔?運蓑?良遵??畔?運蓑?粱^ 1011110111100101001111110011111111001000110010100011111110110001101111111100110010101100001111111100111011001001101111011110010100111111001111111100100011001010001111111011000110111111110011001010110000111111111001001110110101011110 bde53f3fc8ca3fb1bfccac3fcec9bde53f3fc8ca3fb1bfccac3fe4ed5e
UTF-8 遵뀄렭畔섰運蓑렱良遵뀄렭畔섰運蓑렱粱^ 11101001100000011011010111101011100000001000010011101011101000001010110111100111100101011001010011101100100001001011000011101001100000011000101111101000100100111001000111101011101000001011000111101000100010011010111111101001100000011011010111101011100000001000010011101011101000001010110111100111100101011001010011101100100001001011000011101001100000011000101111101000100100111001000111101011101000001011000111100111101100101011000101011110 e981b5eb8084eba0ade79594ec84b0e9818be89391eba0b1e889afe981b5eb8084eba0ade79594ec84b0e9818be89391eba0b1e7b2b15e
UHC 遵뀄렭畔섰運蓑렱良遵뀄렭畔섰運蓑렱粱^ 11110001111001011011001011101101100011101011101011011010111011011011110010111001111010101010000111011110111011101000111010111110110101011101111011110001111001011011001011101101100011101011101011011010111011011011110010111001111010101010000111011110111011101000111010111110110101011101110001011110 f1e5b2ed8ebadaedbcb9eaa1deee8ebed5def1e5b2ed8ebadaedbcb9eaa1deee8ebed5dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)