To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 蹄???蹄???B 1001001011111011001111110011111100111111100100101111101100111111001111110011111101000010 92fb3f3f3f92fb3f3f3f42
EUC-JP 蹄???蹄???B 1100010011111101001111110011111100111111110001001111110100111111001111110011111101000010 c4fd3f3f3fc4fd3f3f3f42
UTF-8 蹄댓렰렣蹄댓렰렣B 11101000101110011000010011101011100011001001001111101011101000001011000011101011101000001010001111101000101110011000010011101011100011001001001111101011101000001011000011101011101000001010001101000010 e8b984eb8c93eba0b0eba0a3e8b984eb8c93eba0b0eba0a342
UHC 蹄댓렰렣蹄댓렰렣B 1111000010110100101101001111000110001110101111011000111010110100111100001011010010110100111100011000111010111101100011101011010001000010 f0b4b4f18ebd8eb4f0b4b4f18ebd8eb442

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)