To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???厓э????厓ц????厓э???? 001111110011111100111111111110101000110110000100100011110011111100111111001111110011111111111010100011011000010010001000001111110011111100111111001111111111101010001101100001001000111100111111001111110011111100111111 3f3f3ffa8d848f3f3f3f3ffa8d84883f3f3f3ffa8d848f3f3f3f3f
EUC-JP ???厓э????厓ц????厓э???? 001111110011111100111111100011111011010011000111101001111110111100111111001111110011111100111111100011111011010011000111101001111110100000111111001111110011111100111111100011111011010011000111101001111110111100111111001111110011111100111111 3f3f3f8fb4c7a7ef3f3f3f3f8fb4c7a7e83f3f3f3f8fb4c7a7ef3f3f3f3f
UTF-8 若듸숴厓э푴若듸숴厓ц춶溫볩숴厓э푵若듸숴 111011111010010110110100111010111001001110111000111011001000100010110100111001011000111010010011110100011000110111101101100100011011010011101111101001011011010011101011100100111011100011101100100010001011010011100101100011101001001111010001100001101110110010110110101101101110011010111010101010111110101110110011101010011110110010001000101101001110010110001110100100111101000110001101111011011001000110110101111011111010010110110100111010111001001110111000111011001000100010110100 efa5b4eb93b8ec88b4e58e93d18ded91b4efa5b4eb93b8ec88b4e58e93d186ecb6b6e6baabebb3a9ec88b4e58e93d18ded91b5efa5b4eb93b8ec88b4
UHC 若듸숴厓э푴若듸숴厓ц춶溫볩숴厓э푵若듸숴 111001011010111010110101111011111011110110100100111001001110110110101100111011111011111010000010111001011010111010110101111011111011110110100100111001001110110110101100111010001010110110010010111010001010111010010011111011111011110110100100111001001110110110101100111011111011111010000011111001011010111010110101111011111011110110100100 e5aeb5efbda4e4edacefbe82e5aeb5efbda4e4edace8ad92e8ae93efbda4e4edacefbe83e5aeb5efbda4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)