To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲ル????喩?????誼?┐誘??娃 1110000110011111100000111000101100111111001111110011111100111111100110100110011100111111001111110011111100111111001111111000101101100010001111111000010010100010100101110101010100111111001111111000100010100001 e19f838b3f3f3f3f9a673f3f3f3f3f8b623f84a297553f3f88a1
EUC-JP 癲ル?佾??喩?????誼?┐誘??娃 11100010101000011010010111101011001111111000111110110000111110110011111100111111110100111100100000111111001111110011111100111111001111111011010111000011001111111010100010100100110011011011011000111111001111111011000010100011 e2a1a5eb3f8fb0fb3f3fd3c83f3f3f3f3fb5c33fa8a4cdb63f3fb0a3
UTF-8 癲ル슡佾붹쾮喩믩뙀列욧퍓誼욑┐誘↔뭅娃 111001111001100110110010111000111000001110101011111011001000101010100001111001001011110110111110111010111011011010111001111011001011111010101110111001011001011010101001111010111010111110101001111010111001100110000000111011111010011010011100111011001001101010100111111011011000110110010011111010001010101010111100111011001001101010010001111000101001010010010000111010001010101010011000111000101000011010010100111010111010110110000101111001011010100010000011 e799b2e383abec8aa1e4bdbeebb6b9ecbeaee596a9ebafa9eb9980efa69cec9aa7ed8d93e8aabcec9a91e29490e8aa98e28694ebad85e5a883
UHC 癲ル슡佾붹쾮喩믩뙀列욧퍓誼욑┐誘↔뭅娃 1110111110100110101010111110101110011010101011011110110011101011100101001110011010110010100001011110101011100111100100101110101110001100100001101110011011101010101111111110101010111011100010101110101111111110100111101110111110100110101001001110101110101111101000011110101010111001101101001110100011011111 efa6abeb9aadeceb94e6b285eae792eb8c86e6eabfeabb8aebfe9eefa6a4ebafa1eab9b4e8df

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)