To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 厄β???ぜ??ザ娃??竊??柔j?? 100101101110111110000011110000000011111100111111001111111000001010111010001111110011111110000011010101011000100010100001001111110011111111100010100001100011111100111111100011110101111110000010100010100011111100111111 96ef83c03f3f3f82ba3f3f835588a13f3fe2863f3f8f5f828a3f3f
EUC-JP 厄β?佾?ぜ??ザ娃??竊??柔j?? 1100110011110001101001101100001000111111100011111011000011111011001111111010010010111100001111110011111110100101101101101011000010100011001111110011111111100011111001100011111100111111101111011100000010100011111010100011111100111111 ccf1a6c23f8fb0fb3fa4bc3f3fa5b6b0a33f3fe3e63f3fbdc0a3ea3f3f
UTF-8 厄β넄佾볢ぜ琉꾩ザ娃븐뼚竊덄춯柔j틓廬 1110010110001110100001001100111010110010111010111000010010000100111001001011110110111110111010111011001110100010111000111000000110011100111011111010011110001100111010101011111010101001111000111000001010110110111001011010100010000011111010111011100010010000111010111011110010011010111001111010101110001010111010111000110110000100111011001011011010101111111001101001111110010100111011111011110110001010111011011000101110010011111011111010011010000010 e58e84ceb2eb8484e4bdbeebb3a2e3819cefa78ceabea9e382b6e5a883ebb890ebbc9ae7ab8aeb8d84ecb6afe69f94efbd8aed8b93efa682
UHC 厄β넄佾볢ぜ琉꾩ザ娃븐뼚竊덄춯柔j틓廬 1110010011111000101001011110001010000110100101001110110011101011100100111110100010101010101111001110101110100100100001001110110010101011101101101110100011011111101110101110110010010110101000001110111110111100100010001110011110101101100011001110101011110101101000111110101010111010100000101110010111111110 e4f8a5e28694eceb93e8aabceba484ecabb6e8dfbaec96a0efbc88e7ad8ceaf5a3eaba82e5fe

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)