To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??UMnf??UMn^}Y??UMnf??UMn^}bE 0011111100111111010101010100110101101110011001100011111100111111010101010100110101101110010111100111110101011001001111110011111101010101010011010110111001100110001111110011111101010101010011010110111001011110011111010110001001000101 3f3f554d6e663f3f554d6e5e7d593f3f554d6e663f3f554d6e5e7d6245
SJIS-WIN 善?UMnf善?UMn^}Y善?UMnf善?UMn^}bE 100100010101000000111111010101010100110101101110011001101001000101010000001111110101010101001101011011100101111001111101010110011001000101010000001111110101010101001101011011100110011010010001010100000011111101010101010011010110111001011110011111010110001001000101 91503f554d6e6691503f554d6e5e7d5991503f554d6e6691503f554d6e5e7d6245
EUC-JP 善?UMnf善?UMn^}Y善?UMnf善?UMn^}bE 110000011011000100111111010101010100110101101110011001101100000110110001001111110101010101001101011011100101111001111101010110011100000110110001001111110101010101001101011011100110011011000001101100010011111101010101010011010110111001011110011111010110001001000101 c1b13f554d6e66c1b13f554d6e5e7d59c1b13f554d6e66c1b13f554d6e5e7d6245
UTF-8 善涉UMnf善涉UMn^}Y善涉UMnf善涉UMn^}bE 111001011001011010000100111001101011011010001001010101010100110101101110011001101110010110010110100001001110011010110110100010010101010101001101011011100101111001111101010110011110010110010110100001001110011010110110100010010101010101001101011011100110011011100101100101101000010011100110101101101000100101010101010011010110111001011110011111010110001001000101 e59684e6b689554d6e66e59684e6b689554d6e5e7d59e59684e6b689554d6e66e59684e6b689554d6e5e7d6245
UHC 善涉UMnf善涉UMn^}Y善涉UMnf善涉UMn^}bE 11100000101111001110000011101111010101010100110101101110011001101110000010111100111000001110111101010101010011010110111001011110011111010101100111100000101111001110000011101111010101010100110101101110011001101110000010111100111000001110111101010101010011010110111001011110011111010110001001000101 e0bce0ef554d6e66e0bce0ef554d6e5e7d59e0bce0ef554d6e66e0bce0ef554d6e5e7d6245

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)