To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??誼ワⅤ怨??辱??誼??醫??壓??誼 1110100111110010001111110011111110001011011000101000001110001111100001110101100010001001100001010011111100111111100100000100101000111111001111111000101101100010001111110011111111100111110011100011111100111111100110101101100000111111001111111000101101100010 e9f23f3f8b62838f875889853f3f904a3f3f8b623f3fe7ce3f3f9ad83f3f8b62
EUC-JP 鶯??誼ワ?怨??辱??誼??醫??壓??誼 11110010111101000011111100111111101101011100001110100101111011110011111110110001111001010011111100111111101111111010101100111111001111111011010111000011001111110011111111101110110100000011111100111111110101001101101000111111001111111011010111000011 f2f43f3fb5c3a5ef3fb1e53f3fbfab3f3fb5c33f3feed03f3fd4da3f3fb5c3
UTF-8 鶯ㅳ꺂誼ワⅤ怨뺤젲辱됰봾誼뚳쫱醫딆젵壓꾨ㅏ誼 111010011011011010101111111000111000010110110011111010101011101010000010111010001010101010111100111000111000001110101111111000101000010110100100111001101000000010101000111010111011101010100100111011001010000010110010111010001011111010110001111010111001000010110000111010111011010010111110111010001010101010111100111010111001101010110011111011001010101110110001111010011000011010101011111010111001010010000110111011001010000010110101111001011010001110010011111010101011111010101000111000111000010110001111111010001010101010111100 e9b6afe385b3eaba82e8aabce383afe285a4e680a8ebbaa4eca0b2e8beb1eb90b0ebb4bee8aabceb9ab3ecabb1e986abeb9486eca0b5e5a393eabea8e3858fe8aabc
UHC 鶯ㅳ꺂誼ワⅤ怨뺤젲辱됰봾誼뚳쫱醫딆젵壓꾨ㅏ誼 1110010110100011101001001110001110000011101010111110101111111110101010111110111110100101101101001110101010110011100101011110110010100000101001101110100110110100100010011110101110010100100001011110101111111110100011001110111110100110100010011110110010100010100010101110110010100000101010011110010011100010100001001110101110100100101111111110101111111110 e5a3a4e383abebfeabefa5b4eab395eca0a6e9b489eb9485ebfe8cefa689eca28aeca0a9e4e284eba4bfebfe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)