To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ???鈺??獄??玉?????鈺??獄??玉??^ 00111111001111110011111111111011110001000011111100111111100011011001011000111111001111111000101111001010001111110011111100111111001111110011111111111011110001000011111100111111100011011001011000111111001111111000101111001010001111110011111101011110 3f3f3ffbc43f3f8d963f3f8bca3f3f3f3f3ffbc43f3f8d963f3f8bca3f3f5e
EUC-JP 旿??鈺??獄??玉??旿??鈺??獄??玉??^ 10001111110000011111010000111111001111111000111111100011110101010011111100111111101110011111011000111111001111111011011011001100001111110011111110001111110000011111010000111111001111111000111111100011110101010011111100111111101110011111011000111111001111111011011011001100001111110011111101011110 8fc1f43f3f8fe3d53f3fb9f63f3fb6cc3f3f8fc1f43f3f8fe3d53f3fb9f63f3fb6cc3f3f5e
UTF-8 旿딉슁鈺뚳쉐獄깍쉴玉먨렚旿딉슁鈺뚳쉐獄깍쉴玉먬뼯^ 11100110100101111011111111101011100101001000100111101100100010101000000111101001100010001011101011101011100110101011001111101100100010011001000011100111100011011000010011101010101110011000110111101100100010011011010011100111100011101000100111101011101010001010100011101011101000001001101011100110100101111011111111101011100101001000100111101100100010101000000111101001100010001011101011101011100110101011001111101100100010011001000011100111100011011000010011101010101110011000110111101100100010011011010011100111100011101000100111101011101010001010110011101011101111001010111101011110 e697bfeb9489ec8a81e988baeb9ab3ec8990e78d84eab98dec89b4e78e89eba8a8eba09ae697bfeb9489ec8a81e988baeb9ab3ec8990e78d84eab98dec89b4e78e89eba8acebbcaf5e
UHC 旿딉슁鈺뚳쉐獄깍쉴玉먨렚旿딉슁鈺뚳쉐獄깍쉴玉먬뼯^ 11100111111110101000101011101111101111011011001111101000101011011000110011101111101111011010011011101000101010111011000111101111101111011010111111101000101011001001000011100101100011101010110111100111111110101000101011101111101111011011001111101000101011011000110011101111101111011010011011101000101010111011000111101111101111011010111111101000101011001001000011101001100101101011001001011110 e7fa8aefbdb3e8ad8cefbda6e8abb1efbdafe8ac90e58eade7fa8aefbdb3e8ad8cefbda6e8abb1efbdafe8ac90e996b25e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)