To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 鵝??裔????????鵝??裔????????^ 1110101001000000001111110011111111100101111000010011111100111111001111110011111100111111001111110011111100111111111010100100000000111111001111111110010111100001001111110011111100111111001111110011111100111111001111110011111101011110 ea403f3fe5e13f3f3f3f3f3f3f3fea403f3fe5e13f3f3f3f3f3f3f3f5e
EUC-JP 鵝??裔????????鵝??裔????????^ 1111001110100001001111110011111111101010111000110011111100111111001111110011111100111111001111110011111100111111111100111010000100111111001111111110101011100011001111110011111100111111001111110011111100111111001111110011111101011110 f3a13f3feae33f3f3f3f3f3f3f3ff3a13f3feae33f3f3f3f3f3f3f3f5e
UTF-8 鵝븍젲裔꾣닡溜곕젨樂꾨뮮鵝븍젲裔꾣닡溜곕젨樂꾨뮠^ 11101001101101011001110111101011101110001000110111101100101000001011001011101000101000111001010011101010101111101010001111101011100010111010000111101111101001111000101111101010101100111001010111101100101000001010100011101111101001101011111111101010101111101010100011101011101011101010111011101001101101011001110111101011101110001000110111101100101000001011001011101000101000111001010011101010101111101010001111101011100010111010000111101111101001111000101111101010101100111001010111101100101000001010100011101111101001101011111111101010101111101010100011101011101011101010000001011110 e9b59debb88deca0b2e8a394eabea3eb8ba1efa78beab395eca0a8efa6bfeabea8ebaeaee9b59debb88deca0b2e8a394eabea3eb8ba1efa78beab395eca0a8efa6bfeabea8ebaea05e
UHC 鵝븍젲裔꾣닡溜곕젨樂꾨뮮鵝븍젲裔꾣닡溜곕젨樂꾨뮠^ 11100100101111011011101011101011101000001010011011100111111000001000010011100110100010001010000111101010111111101011000011101011101000001010000011101000111110011000010011101011100100101011011111100100101111011011101011101011101000001010011011100111111000001000010011100110100010001010000111101010111111101011000011101011101000001010000011101000111110011000010011101011100100101010110001011110 e4bdbaeba0a6e7e084e688a1eafeb0eba0a0e8f984eb92b7e4bdbaeba0a6e7e084e688a1eafeb0eba0a0e8f984eb92ac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)