To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 程?????程?郵?梓?程?????程?郵?梓?B 100100101111011000111111001111110011111100111111001111111001001011110110001111111001011101011000001111111000100010110010001111111001001011110110001111110011111100111111001111110011111110010010111101100011111110010111010110000011111110001000101100100011111101000010 92f63f3f3f3f3f92f63f97583f88b23f92f63f3f3f3f3f92f63f97583f88b23f42
EUC-JP 程?釪???程?郵?梓?程?釪???程?郵?梓?B 11000100111110000011111110001111111000111010110100111111001111110011111111000100111110000011111111001101101110010011111110110000101101000011111111000100111110000011111110001111111000111010110100111111001111110011111111000100111110000011111111001101101110010011111110110000101101000011111101000010 c4f83f8fe3ad3f3f3fc4f83fcdb93fb0b43fc4f83f8fe3ad3f3f3fc4f83fcdb93fb0b43f42
UTF-8 程렣釪닺렣렡程렣郵렮梓됨程렣釪닺렣렡程렣郵렮梓됨B 11100111101010001000101111101011101000001010001111101001100001111010101011101011100010111011101011101011101000001010001111101011101000001010000111100111101010001000101111101011101000001010001111101001100000111011010111101011101000001010111011100110101000101001001111101011100100001010100011100111101010001000101111101011101000001010001111101001100001111010101011101011100010111011101011101011101000001010001111101011101000001010000111100111101010001000101111101011101000001010001111101001100000111011010111101011101000001010111011100110101000101001001111101011100100001010100001000010 e7a88beba0a3e987aaeb8bbaeba0a3eba0a1e7a88beba0a3e983b5eba0aee6a293eb90a8e7a88beba0a3e987aaeb8bbaeba0a3eba0a1e7a88beba0a3e983b5eba0aee6a293eb90a842
UHC 程렣釪닺렣렡程렣郵렮梓됨程렣釪닺렣렡程렣郵렮梓됨B 11101111111011111000111010110100111010011110100110110100111010001000111010110100100011101011001011101111111011111000111010110100111010011110100010001110101110111110111010101001101101011100101011101111111011111000111010110100111010011110100110110100111010001000111010110100100011101011001011101111111011111000111010110100111010011110100010001110101110111110111010101001101101011100101001000010 efef8eb4e9e9b4e88eb48eb2efef8eb4e9e88ebbeea9b5caefef8eb4e9e9b4e88eb48eb2efef8eb4e9e88ebbeea9b5ca42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)