To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻??毅?.違??齬??悠?い怨λ?? 11100100111010000011111100111111100010110100001000111111100000010100010010001000111000010011111100111111111010101001011100111111001111111001011101001001001111111000001010100010100010011000010110000011110010010011111100111111 e4e83f3f8b423f814488e13f3fea973f3f97493f82a2898583c93f3f
EUC-JP 蒻??毅?.違??齬??悠?い怨λ?? 11101000111010100011111100111111101101011010001100111111101000011010010110110000111000110011111100111111111100111111011100111111001111111100110110101010001111111010010010100100101100011110010110100110110010110011111100111111 e8ea3f3fb5a33fa1a5b0e33f3ff3f73f3fcdaa3fa4a4b1e5a6cb3f3f
UTF-8 蒻몃뿭毅싮.違먥뵺齬잙벊悠뽬い怨λ걦力 1110100010010010101110111110101110101010100000111110101110111111101011011110011010101111100001011110110010001011101011101110111110111100100011101110100110000001100101011110101110101000101001011110101110110101101110101110100110111101101011001110110010011110100110011110101110110010100010101110011010000010101000001110101110111101101011001110001110000001100001001110011010000000101010001100111010111011111010101011000110100110111011111010011010001010 e892bbebaa83ebbfade6af85ec8baeefbc8ee98195eba8a5ebb5bae9bdacec9e99ebb28ae682a0ebbdace38184e680a8cebbeab1a6efa68a
UHC 蒻몃뿭毅싮.違먥뵺齬잙벊悠뽬い怨λ걦力 1110010110110110101110001110101110010111101011011110101111110110100110101110100110100011101011101110101011011110100100001110001010010100101110001110010111100001100111111110101110010011101011011110101011101101100101101110100010101010101001001110101010110011101001011110101110000001100011111110011010110011 e5b6b8eb97adebf69ae9a3aeeade90e294b8e5e19feb93adeaed96e8aaa4eab3a5eb818fe6b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)