To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ?ъ??ъ?恁れ 001111111000010010001100001111110011111110000100100011000011111110011100100011001000001011101010 3f848c3f3f848c3f9c8c82ea
EUC-JP ?ъ??ъ?恁れ 001111111010011111101100001111110011111110100111111011000011111111010111111011001010010011101100 3fa7ec3f3fa7ec3fd7eca4ec
UTF-8 泥ъ껌吏ъ콐恁れ 11101111101001111010001111010001100010101110101010111011100011001110111110100111100111101101000110001010111011001011110110010000111001101000000110000001111000111000001010001100 efa7a3d18aeabb8cefa79ed18aecbd90e68181e3828c
UHC 泥ъ껌吏ъ콐恁れ 11101100101100101010110011101100101100101010110111101100101001111010110011101100101100011000110011101100111101101010101011101100 ecb2acecb2adeca7acecb18cecf6aaec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)