To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????鶯??????k????稔??B 00111111001111110011111100111111001111110011111111101001111100100011111100111111001111110011111100111111001111111000001010001011001111110011111100111111001111111001011010101011001111110011111101000010 3f3f3f3f3f3fe9f23f3f3f3f3f3f828b3f3f3f3f96ab3f3f42
EUC-JP ??????鶯??????k????稔??B 00111111001111110011111100111111001111110011111111110010111101000011111100111111001111110011111100111111001111111010001111101011001111110011111100111111001111111100110010101101001111110011111101000010 3f3f3f3f3f3ff2f43f3f3f3f3f3fa3eb3f3f3f3fccad3f3f42
UTF-8 溜삳젔溜븍눋鶯숇젔溜븍㎗溜k졎溜믩졋稔꾠럸B 11101111101001111000101111101100100000101011001111101100101000001001010011101111101001111000101111101011101110001000110111101011100010001000101111101001101101101010111111101100100010001000011111101100101000001001010011101111101001111000101111101011101110001000110111100011100011101001011111101111101001111000101111101111101111011000101111101100101000011000111011101111101001111000101111101011101011111010100111101100101000011000101111100111101010001001010011101010101111101010000011101011100111111011100001000010 efa78bec82b3eca094efa78bebb88deb888be9b6afec8887eca094efa78bebb88de38e97efa78befbd8beca18eefa78bebafa9eca18be7a894eabea0eb9fb842
UHC 溜삳젔溜븍눋鶯숇젔溜븍㎗溜k졎溜믩졋稔꾠럸B 11101010111111101011101111101011101000001001001011101010111111101011101011101011101101001010110011100101101000111001100111101011101000001001001011101010111111101011101011101011101001111010001111101010111111101010001111101011101000001011101111101010111111101001001011101011101000001011101011101100111110011000010011100011100011101001011101000010 eafebbeba092eafebaebb4ace5a399eba092eafebaeba7a3eafea3eba0bbeafe92eba0baecf984e38e9742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)