To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????h 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN ???????????????而???????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110001110101001110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f8ea73f3f3f3f3f3f3f68
EUC-JP ???????????????而???????h 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111110111100101010010011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3fbca93f3f3f3f3f3f3f68
UTF-8 溜삳젘溜뷜뵗溜롫졎溜믩졋溜⑸졋而듬젒溜븍뿊溜괆h 11101111101001111000101111101100100000101011001111101100101000001001100011101111101001111000101111101011101101111001110011101011101101011001011111101111101001111000101111101011101000011010101111101100101000011000111011101111101001111000101111101011101011111010100111101100101000011000101111101111101001111000101111100010100100011011100011101100101000011000101111101000100000001000110011101011100100111010110011101100101000001001001011101111101001111000101111101011101110001000110111101011101111111000101011101111101001111000101111101010101101001000011001101000 efa78bec82b3eca098efa78bebb79cebb597efa78beba1abeca18eefa78bebafa9eca18befa78be291b8eca18be8808ceb93aceca092efa78bebb88debbf8aefa78beab48668
UHC 溜삳젘溜뷜뵗溜롫졎溜믩졋溜⑸졋而듬젒溜븍뿊溜괆h 1110101011111110101110111110101110100000100101001110101011111110101110101110001010010100100110011110101011111110100011101110101110100000101110111110101011111110100100101110101110100000101110101110101011111110101010011110101110100000101110101110110010111011101101011110101110100000100100011110101011111110101110101110101110010111100100011110101011111110101100001111111001101000 eafebbeba094eafebae29499eafe8eeba0bbeafe92eba0baeafea9eba0baecbbb5eba091eafebaeb9791eafeb0fe68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)