To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN ???箝????纜D???箝????纜D^ 00111111001111110011111111100010101011010011111100111111001111110011111111100011100111000100010000111111001111110011111111100010101011010011111100111111001111110011111111100011100111000100010001011110 3f3f3fe2ad3f3f3f3fe39c443f3f3fe2ad3f3f3f3fe39c445e
EUC-JP ???箝????纜D???箝????纜D^ 00111111001111110011111111100100101011110011111100111111001111110011111111100101111111000100010000111111001111110011111111100100101011110011111100111111001111110011111111100101111111000100010001011110 3f3f3fe4af3f3f3f3fe5fc443f3f3fe4af3f3f3f3fe5fc445e
UTF-8 렻렜렺箝늚렜렺파纜D렻렜렺箝늚렜렺파纜D^ 111010111010000010111011111010111010000010011100111010111010000010111010111001111010111010011101111010111000101010011010111010111010000010011100111010111010000010111010111011011000110010001100111001111011101010011100010001001110101110100000101110111110101110100000100111001110101110100000101110101110011110101110100111011110101110001010100110101110101110100000100111001110101110100000101110101110110110001100100011001110011110111010100111000100010001011110 eba0bbeba09ceba0bae7ae9deb8a9aeba09ceba0baed8c8ce7ba9c44eba0bbeba09ceba0bae7ae9deb8a9aeba09ceba0baed8c8ce7ba9c445e
UHC 렻렜렺箝늚렜렺파纜D렻렜렺箝늚렜렺파纜D^ 100011101100001110001110101011101000111011000010110011001100010010110100110001011000111010101110100011101100001011000110110001001101010110111111010001001000111011000011100011101010111010001110110000101100110011000100101101001100010110001110101011101000111011000010110001101100010011010101101111110100010001011110 8ec38eae8ec2ccc4b4c58eae8ec2c6c4d5bf448ec38eae8ec2ccc4b4c58eae8ec2c6c4d5bf445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)