To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猥??音?????倚??酉??筌??異?? 111000001100111000111111001111111000100110111001001111110011111100111111001111110011111110011000110111110011111100111111100100111101000100111111001111111110001010100011001111110011111110001000110110010011111100111111 e0ce3f3f89b93f3f3f3f3f98df3f3f93d13f3fe2a33f3f88d93f3f
EUC-JP 猥??音?????倚??酉??筌??異?? 111000001101000000111111001111111011001010111011001111110011111100111111001111110011111111010000111000010011111100111111110001101101001100111111001111111110010010100101001111110011111110110000110110110011111100111111 e0d03f3fb2bb3f3f3f3f3fd0e13f3fc6d33f3fe4a53f3fb0db3f3f
UTF-8 猥롢뀧音곗뱻濾곌쑬倚싨쐣酉귥춸筌뤾쑴異븀솾 111001111000110010100101111010111010000110100010111010111000000010100111111010011001111110110011111010101011001110010111111010111011000110111011111011111010011010000100111010101011001110001100111011001001000110101100111001011000000010011010111011001000101110101000111011001001000010100011111010011000010110001001111010101011011110100101111011001011011010111000111001111010110110001100111010111010010010111110111011001001000110110100111001111001010110110000111010111011100010000000111011001000011010111110 e78ca5eba1a2eb80a7e99fb3eab397ebb1bbefa684eab38cec91ace5809aec8ba8ec90a3e98589eab7a5ecb6b8e7ad8ceba4beec91b4e795b0ebb880ec86be
UHC 猥롢뀧音곗뱻濾곌쑬倚싨쐣酉귥춸筌뤾쑴異븀솾 111010001110010110001110111000111000010110011110111010111110010110110000111011001001001110100001111001101010010010110000111010101011111010101000111010111110111110011010111001101001110010001001111010111011011110000010111011001010110110010100111011111010011110001111111010101011111010101001111011001011011010111010111001111001100110110010 e8e58ee3859eebe5b0ec93a1e6a4b0eabea8ebef9ae69c89ebb782ecad94efa78feabea9ecb6bae799b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)