To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 瓦?胡競??縞? 100010101010001000111111100011001101001110001011101000110011111100111111100011101100100000111111 8aa23f8cd38ba33f3f8ec83f
EUC-JP 瓦?胡競??縞? 101101001010010000111111101110001101010110110110101001010011111100111111101111001100101000111111 b4a43fb8d5b6a53f3fbcca3f
UTF-8 瓦렧胡競렰렧縞동 111001111001001110100110111010111010000010100111111010001000001110100001111001111010101110110110111010111010000010110000111010111010000010100111111001111011100010011110111010111000111110011001 e793a6eba0a7e883a1e7abb6eba0b0eba0a7e7b89eeb8f99
UHC 瓦렧胡競렰렧縞동 11101000101111111000111010110110111110111101011111001100111001101000111010111101100011101011011011111011110101101011010110111111 e8bf8eb6fbd7cce68ebd8eb6fbd6b5bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)