To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 俑??萸??喩??艶N??????f??? 100110001101101000111111001111111110010011001110001111110011111110011010011001110011111100111111100010011001000010000010011011010011111100111111001111110011111100111111001111111000001010000110001111110011111100111111 98da3f3fe4ce3f3f9a673f3f8990826d3f3f3f3f3f3f82863f3f3f
EUC-JP 俑??萸??喩??艶N??倻???f??? 1101000011011100001111110011111111101000110100000011111100111111110100111100100000111111001111111011000111110000101000111100111000111111001111111000111110110001111101100011111100111111001111111010001111100110001111110011111100111111 d0dc3f3fe8d03f3fd3c83f3fb1f0a3ce3f3f8fb1f63f3f3fa3e63f3f3f
UTF-8 俑앹늿萸썹뙴喩묐짎艶N쎈떑倻귣떽璘f궇鱗꿁 111001001011111110010001111011001001010110111001111010111000101010111111111010001001000010111000111011001000110110111001111010111001100110110100111001011001011010101001111010111010110010010000111011001010011110001110111010001000100110110110111011111011110010101110111011001000111010001000111010111001011010010001111001011000000010111011111010101011011110100011111010111001011010111101111011111010011110101111111011111011110110000110111010101011011010000111111011111010011110110010111010101011111110000001 e4bf91ec95b9eb8abfe890b8ec8db9eb99b4e596a9ebac90eca78ee889b6efbcaeec8e88eb9691e580bbeab7a3eb96bdefa7afefbd86eab687efa7b2eabf81
UHC 俑앹늿萸썹뙴喩묐짎艶N쎈떑倻귣떽璘f궇鱗꿁 111010011011010110011101111011001000100010001000111010111010110110111101111001111000110010110111111010101110011110010001111010111010001110011010111001101111110110100011110011101011110111101011100010111010011111100101101001101000001011101011101101101011110111101100110111101010001111100110100000101010000011101100111001111000010101000010 e9b59dec8888ebadbde78cb7eae791eba39ae6fda3cebdeb8ba7e5a682ebb6bdecdea3e682a0ece78542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)