To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}?????????{^ 001111110011111100111111001111110011111100111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110111101101011110 3f3f3f3f3f3f3f3f3f7d3f3f3f3f3f3f3f3f3f7b5e
SJIS-WIN 辱?????曜??}辱?????曜??{^ 10010000010010100011111100111111001111110011111100111111100101110110101000111111001111110111110110010000010010100011111100111111001111110011111100111111100101110110101000111111001111110111101101011110 904a3f3f3f3f3f976a3f3f7d904a3f3f3f3f3f976a3f3f7b5e
EUC-JP 辱?????曜??}辱?????曜??{^ 10111111101010110011111100111111001111110011111100111111110011011100101100111111001111110111110110111111101010110011111100111111001111110011111100111111110011011100101100111111001111110111101101011110 bfab3f3f3f3f3fcdcb3f3f7dbfab3f3f3f3f3fcdcb3f3f7b5e
UTF-8 辱잓슆獵뤻킓曜섊쫸}辱잓슆獵뤻킓曜섊쫸{^ 111010001011111010110001111011001001111010010011111011001000101010000110111011111010011010100111111010111010010010111011111011011000001010010011111001101001101110011100111011001000010010001010111011001010101110111000011111011110100010111110101100011110110010011110100100111110110010001010100001101110111110100110101001111110101110100100101110111110110110000010100100111110011010011011100111001110110010000100100010101110110010101011101110000111101101011110 e8beb1ec9e93ec8a86efa6a7eba4bbed8293e69b9cec848aecabb87de8beb1ec9e93ec8a86efa6a7eba4bbed8293e69b9cec848aecabb87b5e
UHC 辱잓슆獵뤻킓曜섊쫸}辱잓슆獵뤻킓曜섊쫸{^ 111010011011010010011111111010011001101010011000111001111010011010001111111010011011010010011111111010001111100010011000111001111010011010001111011111011110100110110100100111111110100110011010100110001110011110100110100011111110100110110100100111111110100011111000100110001110011110100110100011110111101101011110 e9b49fe99a98e7a68fe9b49fe8f898e7a68f7de9b49fe99a98e7a68fe9b49fe8f898e7a68f7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)