To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????D?????????D^ 001111110011111100111111001111110011111100111111001111110011111100111111010001000011111100111111001111110011111100111111001111110011111100111111001111110100010001011110 3f3f3f3f3f3f3f3f3f443f3f3f3f3f3f3f3f3f445e
SJIS-WIN 曜?????幽??D曜?????幽??D^ 10010111011010100011111100111111001111110011111100111111100101110100100000111111001111110100010010010111011010100011111100111111001111110011111100111111100101110100100000111111001111110100010001011110 976a3f3f3f3f3f97483f3f44976a3f3f3f3f3f97483f3f445e
EUC-JP 曜?????幽??D曜?????幽??D^ 11001101110010110011111100111111001111110011111100111111110011011010100100111111001111110100010011001101110010110011111100111111001111110011111100111111110011011010100100111111001111110100010001011110 cdcb3f3f3f3f3fcda93f3f44cdcb3f3f3f3f3fcda93f3f445e
UTF-8 曜섎끁溜뗦뎬幽귣쐪D曜섎끁溜뗦뎬幽귣쐪D^ 111001101001101110011100111011001000010010001110111010111000000110000001111011111010011110001011111010111001011110100110111010111000111010101100111001011011100110111101111010101011011110100011111011001001000010101010010001001110011010011011100111001110110010000100100011101110101110000001100000011110111110100111100010111110101110010111101001101110101110001110101011001110010110111001101111011110101010110111101000111110110010010000101010100100010001011110 e69b9cec848eeb8181efa78beb97a6eb8eace5b9bdeab7a3ec90aa44e69b9cec848eeb8181efa78beb97a6eb8eace5b9bdeab7a3ec90aa445e
UHC 曜섎끁溜뗦뎬幽귣쐪D曜섎끁溜뗦뎬幽귣쐪D^ 111010001111100010011000111010111000010110110111111010101111111010001011111001101011010110110100111010101110101110000010111010111001110010001111010001001110100011111000100110001110101110000101101101111110101011111110100010111110011010110101101101001110101011101011100000101110101110011100100011110100010001011110 e8f898eb85b7eafe8be6b5b4eaeb82eb9c8f44e8f898eb85b7eafe8be6b5b4eaeb82eb9c8f445e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)