To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???伊?ぜ惟??孃る?誼??柔ル?沃 00111111001111110011111110001000110010010011111110000010101110101000100011010010001111110011111110011011011011111000001011101001001111111000101101100010001111110011111110001111010111111000001110001011001111111001011110000000 3f3f3f88c93f82ba88d23f3f9b6f82e93f8b623f3f8f5f838b3f9780
EUC-JP ???伊?ぜ惟??孃る?誼??柔ル?沃 00111111001111110011111110110000110010110011111110100100101111001011000011010100001111110011111111010101110100001010010011101011001111111011010111000011001111110011111110111101110000001010010111101011001111111100110111100000 3f3f3fb0cb3fa4bcb0d43f3fd5d0a4eb3fb5c33f3fbdc0a5eb3fcde0
UTF-8 捻뀀맩伊볢ぜ惟듈뵥孃る뿭誼꿨떳柔ル㎣沃 111011111010011010100100111010111000000010000000111010111010011110101001111001001011110010001010111010111011001110100010111000111000000110011100111001101000001110011111111010111001001110001000111010111011010110100101111001011010110110000011111000111000001010001011111010111011111110101101111010001010101010111100111010101011111110101000111010111001011010110011111001101001111110010100111000111000001110101011111000111000111010100011111001101011001010000011 efa6a4eb8080eba7a9e4bc8aebb3a2e3819ce6839feb9388ebb5a5e5ad83e3828bebbfade8aabceabfa8eb96b3e69f94e383abe38ea3e6b283
UHC 捻뀀맩伊볢ぜ惟듈뵥孃る뿭誼꿨떳柔ル㎣沃 1110011011110111101100101110101110010000101100011110110010100101100100111110100010101010101111001110101011101110101101011110001010010100101001001110010110111110101010101110101110010111101011011110101111111110101100101110010110110110101110001110101011110101101010111110101110100111101001111110100010101010 e6f7b2eb90b1eca593e8aabceaeeb5e294a4e5beaaeb97adebfeb2e5b6b8eaf5abeba7a7e8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)