To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????????????????ル?淹??B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111000001110001011001111111001111110111001001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f838b3f9fb93f3f42
EUC-JP ????????????????ル?淹??B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111111010010111101011001111111101111010111011001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fa5eb3fdebb3f3f42
UTF-8 溜삳젗溜삳젷溜뷸뼡溜깅졎溜볥졎溜ル졋淹ㅻ젿B 11101111101001111000101111101100100000101011001111101100101000001001011111101111101001111000101111101100100000101011001111101100101000001011011111101111101001111000101111101011101101111011100011101011101111001010000111101111101001111000101111101010101110011000010111101100101000011000111011101111101001111000101111101011101100111010010111101100101000011000111011101111101001111000101111100011100000111010101111101100101000011000101111100110101101111011100111100011100001011011101111101100101000001011111101000010 efa78bec82b3eca097efa78bec82b3eca0b7efa78bebb7b8ebbca1efa78beab985eca18eefa78bebb3a5eca18eefa78be383abeca18be6b7b9e385bbeca0bf42
UHC 溜삳젗溜삳젷溜뷸뼡溜깅졎溜볥졎溜ル졋淹ㅻ젿B 11101010111111101011101111101011101000001001001111101010111111101011101111101011101000001010101111101010111111101011101011100110100101101010010011101010111111101011000111101011101000001011101111101010111111101001001111101011101000001011101111101010111111101010101111101011101000001011101011100101111101001010010011101011101000001011000101000010 eafebbeba093eafebbeba0abeafebae696a4eafeb1eba0bbeafe93eba0bbeafeabeba0bae5f4a4eba0b142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)