To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鼇??鎰?ゥ怨レ?鼇??鎰?ゥ怨レ?B 1110101010000111001111110011111111101000010011000011111110000011010001001000100110000101100000111000110000111111111010101000011100111111001111111110100001001100001111111000001101000100100010011000010110000011100011000011111101000010 ea873f3fe84c3f83448985838c3fea873f3fe84c3f83448985838c3f42
EUC-JP 鼇??鎰?ゥ怨レ?鼇??鎰?ゥ怨レ?B 1111001111100111001111110011111111101111101011010011111110100101101001011011000111100101101001011110110000111111111100111110011100111111001111111110111110101101001111111010010110100101101100011110010110100101111011000011111101000010 f3e73f3fefad3fa5a5b1e5a5ec3ff3e73f3fefad3fa5a5b1e5a5ec3f42
UTF-8 鼇앸뵃鎰쒒ゥ怨レ벛鼇앸뵃鎰쒒ゥ怨レ벛B 11101001101111001000011111101100100101011011100011101011101101011000001111101001100011101011000011101100100100101001001011100011100000101010010111100110100000001010100011100011100000111010110011101011101100101001101111101001101111001000011111101100100101011011100011101011101101011000001111101001100011101011000011101100100100101001001011100011100000101010010111100110100000001010100011100011100000111010110011101011101100101001101101000010 e9bc87ec95b8ebb583e98eb0ec9292e382a5e680a8e383acebb29be9bc87ec95b8ebb583e98eb0ec9292e382a5e680a8e383acebb29b42
UHC 鼇앸뵃鎰쒒ゥ怨レ벛鼇앸뵃鎰쒒ゥ怨レ벛B 11101000101010001001110111101011100101001000100111101100111100001001110011101001101010111010010111101010101100111010101111101100100100111011011011101000101010001001110111101011100101001000100111101100111100001001110011101001101010111010010111101010101100111010101111101100100100111011011001000010 e8a89deb9489ecf09ce9aba5eab3abec93b6e8a89deb9489ecf09ce9aba5eab3abec93b642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)