To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????on}???????on{^ 001111110011111100111111001111110011111100111111001111110110111101101110011111010011111100111111001111110011111100111111001111110011111101101111011011100111101101011110 3f3f3f3f3f3f3f6f6e7d3f3f3f3f3f3f3f6f6e7b5e
SJIS-WIN 鼎訥?訥?矜┌on}鼎訥?訥?矜┌on{^ 10010011010000111110011001100011001111111110011001100011001111111110000111100000100001001010000101101111011011100111110110010011010000111110011001100011001111111110011001100011001111111110000111100000100001001010000101101111011011100111101101011110 9343e6633fe6633fe1e084a16f6e7d9343e6633fe6633fe1e084a16f6e7b5e
EUC-JP 鼎訥?訥?矜┌on}鼎訥?訥?矜┌on{^ 11000101101001001110101111000100001111111110101111000100001111111110001011100010101010001010001101101111011011100111110111000101101001001110101111000100001111111110101111000100001111111110001011100010101010001010001101101111011011100111101101011110 c5a4ebc43febc43fe2e2a8a36f6e7dc5a4ebc43febc43fe2e2a8a36f6e7b5e
UTF-8 鼎訥렊訥렊矜┌on}鼎訥렊訥렊矜┌on{^ 11101001101111001000111011101000101010001010010111101011101000001000101011101000101010001010010111101011101000001000101011100111100111111001110011100010100101001000110001101111011011100111110111101001101111001000111011101000101010001010010111101011101000001000101011101000101010001010010111101011101000001000101011100111100111111001110011100010100101001000110001101111011011100111101101011110 e9bc8ee8a8a5eba08ae8a8a5eba08ae79f9ce2948c6f6e7de9bc8ee8a8a5eba08ae8a8a5eba08ae79f9ce2948c6f6e7b5e
UHC 鼎訥렊訥렊矜┌on}鼎訥렊訥렊矜┌on{^ 1111000010100011110100101110110110001110101000011101001011101101100011101010000111010000111010001010011010100011011011110110111001111101111100001010001111010010111011011000111010100001110100101110110110001110101000011101000011101000101001101010001101101111011011100111101101011110 f0a3d2ed8ea1d2ed8ea1d0e8a6a36f6e7df0a3d2ed8ea1d2ed8ea1d0e8a6a36f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)