To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????[??????????[^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011011001111110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN ????猥????猥[????猥????猥[^ 001111110011111100111111001111111110000011001110001111110011111100111111001111111110000011001110010110110011111100111111001111110011111111100000110011100011111100111111001111110011111111100000110011100101101101011110 3f3f3f3fe0ce3f3f3f3fe0ce5b3f3f3f3fe0ce3f3f3f3fe0ce5b5e
EUC-JP ????猥????猥[????猥????猥[^ 001111110011111100111111001111111110000011010000001111110011111100111111001111111110000011010000010110110011111100111111001111110011111111100000110100000011111100111111001111110011111111100000110100000101101101011110 3f3f3f3fe0d03f3f3f3fe0d05b3f3f3f3fe0d03f3f3f3fe0d05b5e
UTF-8 렯롏렯렞猥렯롏렯렞猥[렯롏렯렞猥렯롏렯렞猥[^ 111010111010000010101111111010111010000110001111111010111010000010101111111010111010000010011110111001111000110010100101111010111010000010101111111010111010000110001111111010111010000010101111111010111010000010011110111001111000110010100101010110111110101110100000101011111110101110100001100011111110101110100000101011111110101110100000100111101110011110001100101001011110101110100000101011111110101110100001100011111110101110100000101011111110101110100000100111101110011110001100101001010101101101011110 eba0afeba18feba0afeba09ee78ca5eba0afeba18feba0afeba09ee78ca55beba0afeba18feba0afeba09ee78ca5eba0afeba18feba0afeba09ee78ca55b5e
UHC 렯롏렯렞猥렯롏렯렞猥[렯롏렯렞猥렯롏렯렞猥[^ 10001110101111001000111011010101100011101011110010001110101011111110100011100101100011101011110010001110110101011000111010111100100011101010111111101000111001010101101110001110101111001000111011010101100011101011110010001110101011111110100011100101100011101011110010001110110101011000111010111100100011101010111111101000111001010101101101011110 8ebc8ed58ebc8eafe8e58ebc8ed58ebc8eafe8e55b8ebc8ed58ebc8eafe8e58ebc8ed58ebc8eafe8e55b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)