To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲??洵→?癒??馭??矣??獄??爾 111000011001111100111111001111111001111110101011100000011010100000111111100101101111110000111111001111111110100101100110001111110011111111100001111000010011111100111111100011011001011000111111001111111000111010100010 e19f3f3f9fab81a83f96fc3f3fe9663f3fe1e13f3f8d963f3f8ea2
EUC-JP 癲??洵→?癒??馭??矣??獄??爾 111000101010000100111111001111111101111010101101101000101010101000111111110011001111111000111111001111111111000111000111001111110011111111100010111000110011111100111111101110011111011000111111001111111011110010100100 e2a13f3fdeada2aa3fccfe3f3ff1c73f3fe2e33f3fb9f63f3fbca4
UTF-8 癲숈슜洵→끽癒⑸옜馭앮룚矣뺣퓱獄쏆옓爾 111001111001100110110010111011001000100010001000111011001000101010011100111001101011010010110101111000101000011010010010111010111000000110111101111001111001100110010010111000101001000110111000111011001001100010011100111010011010011010101101111011001001010110101110111010111010001110011010111001111001111110100011111010111011101010100011111011011001001110110001111001111000110110000100111011001000111110000110111011001001100010010011111001111000100010111110 e799b2ec8888ec8a9ce6b4b5e28692eb81bde79992e291b8ec989ce9a6adec95aeeba39ae79fa3ebbaa3ed93b1e78d84ec8f86ec9893e788be
UHC 癲숈슜洵→끽癒⑸옜馭앮룚矣뺣퓱獄쏆옓爾 1110111110100110100110011110110010011010101010011110001011100111101000011110011010110011101000111110101110101000101010011110101110111111101111111110010111011111100111011110011010001111100101101110101111111000100101011110101110111111100101111110100010101011100110111110110010011110100110011110110010110011 efa699ec9aa9e2e7a1e6b3a3eba8a9ebbfbfe5df9de68f96ebf895ebbf97e8ab9bec9e99ecb3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)