To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 全?郵?遠????遠?全?郵?遠????遠?^ 10010001010100110011111110010111010110000011111110001001100100110011111100111111001111110011111110001001100100110011111110010001010100110011111110010111010110000011111110001001100100110011111100111111001111110011111110001001100100110011111101011110 91533f97583f89933f3f3f3f89933f91533f97583f89933f3f3f3f89933f5e
EUC-JP 全?郵?遠????遠?全?郵?遠????遠?^ 11000001101101000011111111001101101110010011111110110001111100110011111100111111001111110011111110110001111100110011111111000001101101000011111111001101101110010011111110110001111100110011111100111111001111110011111110110001111100110011111101011110 c1b43fcdb93fb1f33f3f3f3fb1f33fc1b43fcdb93fb1f33f3f3f3fb1f33f5e
UTF-8 全렣郵렮遠펫쓺僚렮遠탉全렣郵렮遠펫쓺僚렮遠탉^ 11100101100001011010100011101011101000001010001111101001100000111011010111101011101000001010111011101001100000011010000011101101100011101010101111101100100100111011101011101111101001101011101111101011101000001010111011101001100000011010000011101101100000111000100111100101100001011010100011101011101000001010001111101001100000111011010111101011101000001010111011101001100000011010000011101101100011101010101111101100100100111011101011101111101001101011101111101011101000001010111011101001100000011010000011101101100000111000100101011110 e585a8eba0a3e983b5eba0aee981a0ed8eabec93baefa6bbeba0aee981a0ed8389e585a8eba0a3e983b5eba0aee981a0ed8eabec93baefa6bbeba0aee981a0ed83895e
UHC 全렣郵렮遠펫쓺僚렮遠탉全렣郵렮遠펫쓺僚렮遠탉^ 111011101110111110001110101101001110100111101000100011101011101111101010110000001100011011101010101111101011011011101000111010001000111010111011111010101100000011000101101111001110111011101111100011101011010011101001111010001000111010111011111010101100000011000110111010101011111010110110111010001110100010001110101110111110101011000000110001011011110001011110 eeef8eb4e9e88ebbeac0c6eabeb6e8e88ebbeac0c5bceeef8eb4e9e88ebbeac0c6eabeb6e8e88ebbeac0c5bc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)