To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 俉???ゅ?瘟?????俉???ゅ?節?????^ 11111010011000010011111100111111001111111000001011100011001111111110000110001001001111110011111100111111001111110011111111111010011000010011111100111111001111111000001011100011001111111001000011011111001111110011111100111111001111110011111101011110 fa613f3f3f82e33fe1893f3f3f3f3ffa613f3f3f82e33f90df3f3f3f3f3f5e
EUC-JP 俉???ゅ?瘟?????俉???ゅ?節?????^ 100011111011000110111011001111110011111100111111101001001110010100111111111000011110100100111111001111110011111100111111001111111000111110110001101110110011111100111111001111111010010011100101001111111100000011100001001111110011111100111111001111110011111101011110 8fb1bb3f3f3fa4e53fe1e93f3f3f3f3f8fb1bb3f3f3fa4e53fc0e13f3f3f3f3f5e
UTF-8 俉드뫕濾ゅ쪢瘟욥썚娛숃찠俉드뫕濾ゅ쪤節곈썚娛숃눛^ 11100100101111111000100111101011100100111001110011101011101010111001010111101111101001101000010011100011100000101000010111101100101010101010001011100111100110001001111111101100100110101010010111101100100011011001101011100101101010001001101111101100100010001000001111101100101100001010000011100100101111111000100111101011100100111001110011101011101010111001010111101111101001101000010011100011100000101000010111101100101010101010010011100111101011111000000011101010101100111000100011101100100011011001101011100101101010001001101111101100100010001000001111101011100010001001101101011110 e4bf89eb939cebab95efa684e38285ecaaa2e7989fec9aa5ec8d9ae5a89bec8883ecb0a0e4bf89eb939cebab95efa684e38285ecaaa4e7af80eab388ec8d9ae5a89bec8883eb889b5e
UHC 俉드뫕濾ゅ쪢瘟욥썚娛숃찠俉드뫕濾ゅ쪤節곈썚娛숃눛^ 11100111111010111011010111100101100100011011011111100110101001001010101011100101101001011001101111101000101100001011111111101001100110111000110111100111111101001001100111101000101010011001111011100111111010111011010111100101100100011011011111100110101001001010101011100101101001011001110111101111101111011011000011101001100110111000110111100111111101001001100111101000100001111011001101011110 e7ebb5e591b7e6a4aae5a59be8b0bfe99b8de7f499e8a99ee7ebb5e591b7e6a4aae5a59defbdb0e99b8de7f499e887b35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)