To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣??蹂l????異??碎??沃?(陰 001111110011111100111111100010111000001100111111001111111110011011111000100000101000110000111111001111110011111100111111100010001101100100111111001111111110000111101010001111110011111110010111100000000011111110000001011010011000100101000001 3f3f3f8b833f3fe6f8828c3f3f3f3f88d93f3fe1ea3f3f97803f81698941
EUC-JP ???泣??蹂l????異??碎??沃?(陰 001111110011111100111111101101011110001100111111001111111110110011111010101000111110110000111111001111110011111100111111101100001101101100111111001111111110001011101100001111110011111111001101111000000011111110100001110010101011000110100010 3f3f3fb5e33f3fecfaa3ec3f3f3f3fb0db3f3fe2ec3f3fcde03fa1cab1a2
UTF-8 捻꿔꺂泣붺뙴蹂l떨烈쒕굞異룩첑碎ㅻ깹沃쇰(陰 111011111010011010100100111010101011111110010100111010101011101010000010111001101011001110100011111010111011011010111010111010111001100110110100111010001011100110000010111011111011110110001100111010111001011010101000111011111010011010011111111011001001001010010101111010101011010110011110111001111001010110110000111010111010001110101001111011001011001010010001111001111010001010001110111000111000010110111011111010101011100110111001111001101011001010000011111011001000011110110000111011111011110010001000111010011001100110110000 efa6a4eabf94eaba82e6b3a3ebb6baeb99b4e8b982efbd8ceb96a8efa69fec9295eab59ee795b0eba3a9ecb291e7a28ee385bbeab9b9e6b283ec87b0efbc88e999b0
UHC 捻꿔꺂泣붺뙴蹂l떨烈쒕굞異룩첑碎ㅻ깹沃쇰(陰 1110011011110111101100101110001110000011101010111110101111101000100101001110011110001100101101111110101110110011101000111110110010110110101100111110011011101111100111001110101110000010100001101110110010110110101101111110100010101010100111101110000111101111101001001110101110110010101000011110100010101010101111001110101110100011101010001110101111100100 e6f7b2e383abebe894e78cb7ebb3a3ecb6b3e6ef9ceb8286ecb6b7e8aa9ee1efa4ebb2a1e8aabceba3a8ebe4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)