To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???nR???n^[???nR???n^[^ 0011111100111111001111110110111001010010001111110011111100111111011011100101111001011011001111110011111100111111011011100101001000111111001111110011111101101110010111100101101101011110 3f3f3f6e523f3f3f6e5e5b3f3f3f6e523f3f3f6e5e5b5e
SJIS-WIN 烏k?nR烏k?n^[烏k?nR烏k?n^[^ 10001001010001111000001010001011001111110110111001010010100010010100011110000010100010110011111101101110010111100101101110001001010001111000001010001011001111110110111001010010100010010100011110000010100010110011111101101110010111100101101101011110 8947828b3f6e528947828b3f6e5e5b8947828b3f6e528947828b3f6e5e5b5e
EUC-JP 烏k?nR烏k?n^[烏k?nR烏k?n^[^ 10110001101010001010001111101011001111110110111001010010101100011010100010100011111010110011111101101110010111100101101110110001101010001010001111101011001111110110111001010010101100011010100010100011111010110011111101101110010111100101101101011110 b1a8a3eb3f6e52b1a8a3eb3f6e5e5bb1a8a3eb3f6e52b1a8a3eb3f6e5e5b5e
UTF-8 烏k젌nR烏k젌n^[烏k젌nR烏k젌n^[^ 1110011110000011100011111110111110111101100010111110110010100000100011000110111001010010111001111000001110001111111011111011110110001011111011001010000010001100011011100101111001011011111001111000001110001111111011111011110110001011111011001010000010001100011011100101001011100111100000111000111111101111101111011000101111101100101000001000110001101110010111100101101101011110 e7838fefbd8beca08c6e52e7838fefbd8beca08c6e5e5be7838fefbd8beca08c6e52e7838fefbd8beca08c6e5e5b5e
UHC 烏k젌nR烏k젌n^[烏k젌nR烏k젌n^[^ 1110100010100001101000111110101110100000100011010110111001010010111010001010000110100011111010111010000010001101011011100101111001011011111010001010000110100011111010111010000010001101011011100101001011101000101000011010001111101011101000001000110101101110010111100101101101011110 e8a1a3eba08d6e52e8a1a3eba08d6e5e5be8a1a3eba08d6e52e8a1a3eba08d6e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)