To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????N}?????????N{^ 0011111100111111001111110011111100111111001111110011111100111111001111110100111001111101001111110011111100111111001111110011111100111111001111110011111100111111010011100111101101011110 3f3f3f3f3f3f3f3f3f4e7d3f3f3f3f3f3f3f3f3f4e7b5e
SJIS-WIN ??ⅴ???曄η?N}??ⅴ???曄η?N{^ 0011111100111111111110100100010000111111001111110011111110011110010000001000001111000101001111110100111001111101001111110011111111111010010001000011111100111111001111111001111001000000100000111100010100111111010011100111101101011110 3f3ffa443f3f3f9e4083c53f4e7d3f3ffa443f3f3f9e4083c53f4e7b5e
EUC-JP ??????曄η?N}??????曄η?N{^ 001111110011111100111111001111110011111100111111110110111010000110100110110001110011111101001110011111010011111100111111001111110011111100111111001111111101101110100001101001101100011100111111010011100111101101011110 3f3f3f3f3f3fdba1a6c73f4e7d3f3f3f3f3f3fdba1a6c73f4e7b5e
UTF-8 女사ⅴ列쀦왊曄η떣N}女사ⅴ列쀦왊曄η떣N{^ 111011111010011010000001111011001000001010101100111000101000010110110100111011111010011010011100111011001000000010100110111011001001100110001010111001101001101110000100110011101011011111101011100101101010001101001110011111011110111110100110100000011110110010000010101011001110001010000101101101001110111110100110100111001110110010000000101001101110110010011001100010101110011010011011100001001100111010110111111010111001011010100011010011100111101101011110 efa681ec82ace285b4efa69cec80a6ec998ae69b84ceb7eb96a34e7defa681ec82ace285b4efa69cec80a6ec998ae69b84ceb7eb96a34e7b5e
UHC 女사ⅴ列쀦왊曄η떣N}女사ⅴ列쀦왊曄η떣N{^ 1110010111111100101110111110011110100101101001011110011011101010100101111110011010011110101110111110011110100101101001011110011110001011101101110100111001111101111001011111110010111011111001111010010110100101111001101110101010010111111001101001111010111011111001111010010110100101111001111000101110110111010011100111101101011110 e5fcbbe7a5a5e6ea97e69ebbe7a5a5e78bb74e7de5fcbbe7a5a5e6ea97e69ebbe7a5a5e78bb74e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)