To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 鼇??泣??幽? 1110101010000111001111110011111110001011100000110011111100111111100101110100100000111111 ea873f3f8b833f3f97483f
EUC-JP 鼇??泣??幽? 1111001111100111001111110011111110110101111000110011111100111111110011011010100100111111 f3e73f3fb5e33f3fcda93f
UTF-8 鼇앸뜉泣닷쫩幽욁 111010011011110010000111111011001001010110111000111010111001110010001001111001101011001110100011111010111000101110110111111011001010101110101001111001011011100110111101111011001001101010000001 e9bc87ec95b8eb9c89e6b3a3eb8bb7ecaba9e5b9bdec9a81
UHC 鼇앸뜉泣닷쫩幽욁 11101000101010001001110111101011100011011000110011101011111010001011010011100101101001101000001011101010111010111001111011100011 e8a89deb8d8cebe8b4e5a682eaeb9ee3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)