To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 阡慕距ホ・郤娯斡 1110100010010100100101011110011110001011100101111100111010100101111001111011101010001100111000101000100010110100 e89495e78b97cea5e7ba8ce288b4
EUC-JP 阡慕距ホ・郤娯斡 11101111111101001100101011101001101101011111011110001110110011101000111010100101111011101011110010111000111001001011000010110110 eff4cae9b5f78ece8ea5eebcb8e4b0b6
UTF-8 阡慕距ホ・郤娯斡 111010011001100010100001111001101000010110010101111010001011011110011101111011111011111010001110111011111011110110100101111010011000001110100100111001011010100010101111111001101001011010100001 e998a1e68595e8b79defbe8eefbda5e983a4e5a8afe696a1
UHC 阡慕距????斡 111101001100011011011001101101111100101111100101001111110011111100111111001111111110010011010110 f4c6d9b7cbe53f3f3f3fe4d6

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)