To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 荵溘f繪ャ蠅コ夋マ荵溘f繪ャ蠅コ夋ボ^ 111001001011100110011111111000111000001010000110111000111000100110101100111001011010001010111010111110101001111110000011011111011110010010111001100111111110001110000010100001101110001110001001101011001110010110100010101110101111101010011111100000110111101101011110 e4b99fe38286e389ace5a2bafa9f837de4b99fe38286e389ace5a2bafa9f837b5e
EUC-JP 荵溘f繪ャ蠅コ夋マ荵溘f繪ャ蠅コ夋ボ^ 111010001011101111011110111001011010001111100110111001011110100110001110101011001110101010100100100011101011101010001111101110001110000110100101110111101110100010111011110111101110010110100011111001101110010111101001100011101010110011101010101001001000111010111010100011111011100011100001101001011101110001011110 e8bbdee5a3e6e5e98eaceaa48eba8fb8e1a5dee8bbdee5a3e6e5e98eaceaa48eba8fb8e1a5dc5e
UTF-8 荵溘f繪ャ蠅コ夋マ荵溘f繪ャ蠅コ夋ボ^ 11101000100011011011010111100110101110101001100011101111101111011000011011100111101110011010101011101111101111011010110011101000101000001000010111101111101111011011101011100101101001001000101111100011100000111001111011101000100011011011010111100110101110101001100011101111101111011000011011100111101110011010101011101111101111011010110011101000101000001000010111101111101111011011101011100101101001001000101111100011100000111001110001011110 e88db5e6ba98efbd86e7b9aaefbdace8a085efbdbae5a48be3839ee88db5e6ba98efbd86e7b9aaefbdace8a085efbdbae5a48be3839c5e
UHC ??f繪?蠅??マ??f繪?蠅??ボ^ 001111110011111110100011111001101111110011101011001111111110001110110010001111110011111110101011110111100011111100111111101000111110011011111100111010110011111111100011101100100011111100111111101010111101110001011110 3f3fa3e6fceb3fe3b23f3fabde3f3fa3e6fceb3fe3b23f3fabdc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)