To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????AB 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110100000101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f4142
SJIS-WIN 偲痔偲鹿篠ホナ爾篠フト宍偲辞篠湿篠示AB 10001110110000111000111010100100100011101100001110001110101011011000111011000010110011101100010110001110101000101000111011000010110011001100010010001110101100111000111011000011100011101010101110001110110000101000111010111100100011101100001010001110101001100100000101000010 8ec38ea48ec38ead8ec2cec58ea28ec2ccc48eb38ec38eab8ec28ebc8ec28ea64142
EUC-JP 偲痔偲鹿篠ホナ爾篠フト宍偲辞篠湿篠示AB 1011110011000101101111001010011010111100110001011011110010101111101111001100010010001110110011101000111011000101101111001010010010111100110001001000111011001100100011101100010010111100101101011011110011000101101111001010110110111100110001001011110010111110101111001100010010111100101010000100000101000010 bcc5bca6bcc5bcafbcc48ece8ec5bca4bcc48ecc8ec4bcb5bcc5bcadbcc4bcbebcc4bca84142
UTF-8 偲痔偲鹿篠ホナ爾篠フト宍偲辞篠湿篠示AB 1110010110000001101100101110011110010111100101001110010110000001101100101110100110111001101111111110011110101111101000001110111110111110100011101110111110111110100001011110011110001000101111101110011110101111101000001110111110111110100011001110111110111110100001001110010110101110100011011110010110000001101100101110100010111110100111101110011110101111101000001110011010111001101111111110011110101111101000001110011110100100101110100100000101000010 e581b2e79794e581b2e9b9bfe7afa0efbe8eefbe85e788bee7afa0efbe8cefbe84e5ae8de581b2e8be9ee7afa0e6b9bfe7afa0e7a4ba4142
UHC ?痔?鹿篠??爾篠?????篠?篠示AB 00111111111101101100000000111111110101101110001111100001110001100011111100111111111011001011001111100001110001100011111100111111001111110011111100111111111000011100011000111111111000011100011011100011110001100100000101000010 3ff6c03fd6e3e1c63f3fecb3e1c63f3f3f3f3fe1c63fe1c6e3c64142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)