To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????\ 0011111100111111001111110011111100111111001111110011111100111111001111110011111101011100 3f3f3f3f3f3f3f3f3f3f5c
SJIS-WIN ??移?鷹??怨峰舞\ 00111111001111111000100011011010001111111001000111101001001111110011111110001001100001011001010111110100100101011001000101011100 3f3f88da3f91e93f3f898595f495915c
EUC-JP ??移?鷹??怨峰舞\ 00111111001111111011000011011100001111111100001011101011001111110011111110110001111001011100101011110110110010011111000101011100 3f3fb0dc3fc2eb3f3fb1e5caf6c9f15c
UTF-8 欌렪移렊鷹꿴떵怨峰舞\ 11100110101011001000110011101011101000001010101011100111101001111011101111101011101000001000101011101001101101111011100111101010101111111011010011101011100101101011010111100110100000001010100011100101101100111011000011101000100010001001111001011100 e6ac8ceba0aae7a7bbeba08ae9b7b9eabfb4eb96b5e680a8e5b3b0e8889e5c
UHC 欌렪移렊鷹꿴떵怨峰舞\ 111011011110101110001110101110001110110010111001100011101010000111101011111011011011001011101001101101101011101011101010101100111101110011101000110110011111000101011100 edeb8eb8ecb98ea1ebedb2e9b6baeab3dce8d9f15c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)