To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????^ 00111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ??????宋??^ 0011111100111111001111110011111100111111001111111001000101110110001111110011111101011110 3f3f3f3f3f3f91763f3f5e
EUC-JP ???靷??宋??^ 00111111001111110011111110001111111001111011110100111111001111111100000111010111001111110011111101011110 3f3f3f8fe7bd3f3fc1d73f3f5e
UTF-8 嶺뚢넄靷믥춯宋뷀늿^ 11101111101001101010101111101011100110101010001011101011100001001000010011101001100111011011011111101011101011111010010111101100101101101010111111100101101011101000101111101011101101111000000011101011100010101011111101011110 efa6abeb9aa2eb8484e99db7ebafa5ecb6afe5ae8bebb780eb8abf5e
UHC 嶺뚢넄靷믥춯宋뷀늿^ 11100111101011011000110011100010100001101001010011101100111001101001001011100111101011011000110011100001111001001001010011101101100010001000100001011110 e7ad8ce28694ece692e7ad8ce1e494ed88885e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)