To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????n}?????????n{^ 0011111100111111001111110011111100111111001111110011111100111111001111110110111001111101001111110011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 嗚??鳶??橈??n}嗚??鳶??橈??n{^ 1001101001101010001111110011111110010011110011100011111100111111100111101111010000111111001111110110111001111101100110100110101000111111001111111001001111001110001111110011111110011110111101000011111100111111011011100111101101011110 9a6a3f3f93ce3f3f9ef43f3f6e7d9a6a3f3f93ce3f3f9ef43f3f6e7b5e
EUC-JP 嗚??鳶??橈??n}嗚??鳶??橈??n{^ 1101001111001011001111110011111111000110110100000011111100111111110111001111011000111111001111110110111001111101110100111100101100111111001111111100011011010000001111110011111111011100111101100011111100111111011011100111101101011110 d3cb3f3fc6d03f3fdcf63f3f6e7dd3cb3f3fc6d03f3fdcf63f3f6e7b5e
UTF-8 嗚잓쑄鳶껇뮓橈ㅷ땭n}嗚잓쑄鳶껇뮓橈ㅷ땭n{^ 1110010110010111100110101110110010011110100100111110110010010001100001001110100110110011101101101110101010111011100001111110101110101110100100111110011010101001100010001110001110000101101101111110101110010101101011010110111001111101111001011001011110011010111011001001111010010011111011001001000110000100111010011011001110110110111010101011101110000111111010111010111010010011111001101010100110001000111000111000010110110111111010111001010110101101011011100111101101011110 e5979aec9e93ec9184e9b3b6eabb87ebae93e6a988e385b7eb95ad6e7de5979aec9e93ec9184e9b3b6eabb87ebae93e6a988e385b7eb95ad6e7b5e
UHC 嗚잓쑄鳶껇뮓橈ㅷ땭n}嗚잓쑄鳶껇뮓橈ㅷ땭n{^ 1110011111110000100111111110100110011100101001001110011011101001100000111110100010010010100111111110100011111010101001001110011110001011100000110110111001111101111001111111000010011111111010011001110010100100111001101110100110000011111010001001001010011111111010001111101010100100111001111000101110000011011011100111101101011110 e7f09fe99ca4e6e983e8929fe8faa4e78b836e7de7f09fe99ca4e6e983e8929fe8faa4e78b836e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)