To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??を??を??オ??を??ヲ??や◆? 001111110011111110000010111100000011111100111111100000101111000000111111001111111000001101001001001111110011111110000010111100000011111100111111100000111001001000111111001111111000001011100010100000011001111100111111 3f3f82f03f3f82f03f3f83493f3f82f03f3f83923f3f82e2819f3f
EUC-JP ??を??を??オ??を??ヲ??や◆? 001111110011111110100100111100100011111100111111101001001111001000111111001111111010010110101010001111110011111110100100111100100011111100111111101001011111001000111111001111111010010011100100101000101010000100111111 3f3fa4f23f3fa4f23f3fa5aa3f3fa4f23f3fa5f23f3fa4e4a2a13f
UTF-8 룶쥚を룶쥚を룴횕オ룶쥚を룵欄ヲ룶쥚や◆룶 111010111010001110110110111011001010010110011010111000111000001010010010111010111010001110110110111011001010010110011010111000111000001010010010111010111010001110110100111011011001101010010101111000111000001010101010111010111010001110110110111011001010010110011010111000111000001010010010111010111010001110110101111011111010010010011101111000111000001110110010111010111010001110110110111011001010010110011010111000111000001010000100111000101001011110000110111010111010001110110110 eba3b6eca59ae38292eba3b6eca59ae38292eba3b4ed9a95e382aaeba3b6eca59ae38292eba3b5efa49de383b2eba3b6eca59ae38284e29786eba3b6
UHC 룶쥚を룶쥚を룴횕オ룶쥚を룵欄ヲ룶쥚や◆룶 10001111101010111010001010001111101010101111001010001111101010111010001010001111101010101111001010001111101010011100001110001111101010111010101010001111101010111010001010001111101010101111001010001111101010101101000111101101101010111111001010001111101010111010001010001111101010101110010010100001110111111000111110101011 8faba28faaf28faba28faaf28fa9c38fabaa8faba28faaf28faad1edabf28faba28faae4a1df8fab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)