To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊??幽??鈺??援?┐誘??筌??? 0011111100111111001111111110001010000110001111110011111110010111010010000011111100111111111110111100010000111111001111111000100110000111001111111000010010100010100101110101010100111111001111111110001010100011001111110011111100111111 3f3f3fe2863f3f97483f3ffbc43f3f89873f84a297553f3fe2a33f3f3f
EUC-JP ???竊??幽??鈺??援?┐誘??筌??? 001111110011111100111111111000111110011000111111001111111100110110101001001111110011111110001111111000111101010100111111001111111011000111100111001111111010100010100100110011011011011000111111001111111110010010100101001111110011111100111111 3f3f3fe3e63f3fcda93f3f8fe3d53f3fb1e73fa8a4cdb63f3fe4a53f3f3f
UTF-8 捻뀁뮆竊섉꼷幽귦맪鈺곗뼦援ㅿ┐誘↔틓筌뗢넁吏 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010001001111010101011110010110111111001011011100110111101111010101011011110100110111010111010011110101010111010011000100010111010111010101011001110010111111010111011110010100110111001101000111110110100111000111000010110111111111000101001010010010000111010001010101010011000111000101000011010010100111011011000101110010011111001111010110110001100111010111001011110100010111010111000010010000001111011111010011110011110 efa6a4eb8081ebae86e7ab8aec8489eabcb7e5b9bdeab7a6eba7aae988baeab397ebbca6e68fb4e385bfe29490e8aa98e28694ed8b93e7ad8ceb97a2eb8481efa79e
UHC 捻뀁뮆竊섉꼷幽귦맪鈺곗뼦援ㅿ┐誘↔틓筌뗢넁吏 1110011011110111101100101110110010010010100101011110111110111100100110001110011010000100100011111110101011101011100000101110110110010000101100101110100010101101101100001110110010010110101010011110101010110101101001001110111110100110101001001110101110101111101000011110101010111010100000101110111110100111100010111110001010000110100100011110110010100111 e6f7b2ec9295efbc98e6848feaeb82ed90b2e8adb0ec96a9eab5a4efa6a4ebafa1eaba82efa78be28691eca7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)