To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8柚??宥?????日??猷???λ? 11100001100111110011111110000010010101111001011101001101001111110011111110010111010001110011111100111111001111110011111100111111100100111111101000111111001111111001011101010001001111110011111100111111100000111100100100111111 e19f3f8257974d3f3f97473f3f3f3f3f93fa3f3f97513f3f3f83c93f
EUC-JP 癲?8柚??宥?????日??猷???λ? 11100010101000010011111110100011101110001100110110101110001111110011111111001101101010000011111100111111001111110011111100111111110001101111110000111111001111111100110110110010001111110011111100111111101001101100101100111111 e2a13fa3b8cdae3f3fcda83f3f3f3f3fc6fc3f3fcdb23f3f3fa6cb3f
UTF-8 癲쒕8柚삯뜮宥멸콟銳얜㉡日뗩툣猷몃괭若λ푶 1110011110011001101100101110110010010010100101011110111110111100100110001110011010011111100110101110110010000010101011111110101110011100101011101110010110101110101001011110101110101001101110001110110010111101100111111110100110001010101100111110110010010110100111001110001110001001101000011110011010010111101001011110101110010111101010011110110110001000101000111110011110001100101101111110101110101010100000111110101010110100101011011110111110100101101101001100111010111011111011011001000110110110 e799b2ec9295efbc98e69f9aec82afeb9caee5aea5eba9b8ecbd9fe98ab3ec969ce389a1e697a5eb97a9ed88a3e78cb7ebaa83eab4adefa5b4cebbed91b6
UHC 癲쒕8柚삯뜮宥멸콟銳얜㉡日뗩툣猷몃괭若λ푶 111011111010011010011100111010111010001110111000111010101111011010111011111010011000110110101110111010101110100110111000111010101011000110010111111001111110010110111110111010111010100010110010111011001110110110001011111010011011100010011010111010111010001110111000111010111011000110101010111001011010111010100101111010111011111010000100 efa69ceba3b8eaf6bbe98daeeae9b8eab197e7e5beeba8b2eced8be9b89aeba3b8ebb1aae5aea5ebbe84

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)