To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 癲?????袁k?嚥△?急乳??循??^ 11100001100111110011111100111111001111110011111100111111111001011100110110000010100010110011111110011010100010111000000110100010001111111000101101111101100100111111101100111111001111111000111101111010001111110011111101011110 e19f3f3f3f3f3fe5cd828b3f9a8b81a23f8b7d93fb3f3f8f7a3f3f5e
EUC-JP 癲??堉??袁k?嚥△?急乳??循??^ 111000101010000100111111001111111000111110110111111111010011111100111111111010101100111110100011111010110011111111010011111010111010001010100100001111111011010111011110110001101111110100111111001111111011110111011011001111110011111101011110 e2a13f3f8fb7fd3f3feacfa3eb3fd3eba2a43fb5dec6fd3f3fbddb3f3f5e
UTF-8 癲욌맩堉낂솻袁k뮅嚥△뱭急乳뜻깱循뗪틮^ 11100111100110011011001011101100100110101000110011101011101001111010100111100101101000001000100111101011100000101000001011101100100001101011101111101000101000101000000111101111101111011000101111101011101011101000010111100101100110101010010111100010100101101011001111101011101100011010110111100110100000001010010111100100101110011011001111101011100111001011101111101010101110011011000111100101101111101010101011101011100101111010101011101101100010111010111001011110 e799b2ec9a8ceba7a9e5a089eb8282ec86bbe8a281efbd8bebae85e59aa5e296b3ebb1ade680a5e4b9b3eb9cbbeab9b1e5beaaeb97aaed8bae5e
UHC 癲욌맩堉낂솻袁k뮅嚥△뱭急乳뜻깱循뗪틮^ 111011111010011010011110111010111001000010110001111010111011110010000101111010011001100110110000111010101011111010100011111010111001001010010100111001101011111110100001111000101001001110010011110100001110000111101010111000011011011011100110100000111001111111100010111000001000101111101010101110101001100001011110 efa69eeb90b1ebbc85e999b0eabea3eb9294e6bfa1e29393d0e1eae1b6e6839fe2e08beaba985e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)