To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???乙??耶??? 001111110011111100111111100010011011001100111111001111111001011011101011001111110011111100111111 3f3f3f89b33f3f96eb3f3f3f
EUC-JP ???乙??耶??? 001111110011111100111111101100101011010100111111001111111100110011101101001111110011111100111111 3f3f3fb2b53f3fcced3f3f3f
UTF-8 捻뀁뫑乙대탿耶쇰쉴劉 111011111010011010100100111010111000000010000001111010111010101110010001111001001011100110011001111010111000110010000000111011011000001110111111111010001000000010110110111011001000011110110000111011001000100110110100111011111010011110000111 efa6a4eb8081ebab91e4b999eb8c80ed83bfe880b6ec87b0ec89b4efa787
UHC 捻뀁뫑乙대탿耶쇰쉴劉 1110011011110111101100101110110010010001101100111110101111100000101101001110101110110101100110111110010110101101101111001110101110111101101011111110101011100101 e6f7b2ec91b3ebe0b4ebb59be5adbcebbdafeae5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)