To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鶯??伊??應??艶j????邑??筌 1110100111110010001111110011111110001000110010010011111100111111100111001110010000111111001111111000100110010000100000101000101000111111001111110011111100111111100101110101011100111111001111111110001010100011 e9f23f3f88c93f3f9ce43f3f8990828a3f3f3f3f97573f3fe2a3
EUC-JP 鶯??伊??應??艶j?荑??邑??筌 11110010111101000011111100111111101100001100101100111111001111111101100011100110001111110011111110110001111100001010001111101010001111111000111111010111111110010011111100111111110011011011100000111111001111111110010010100101 f2f43f3fb0cb3f3fd8e63f3fb1f0a3ea3f8fd7f93f3fcdb83f3fe4a5
UTF-8 鶯ㅺ퉮伊쒏룚應밸룆艶j퍒荑녻돻邑뀀뼲筌 111010011011011010101111111000111000010110111010111011011000100110101110111001001011110010001010111011001001001010001111111010111010001110011010111001101000011110001001111010111011000010111000111010111010001110000110111010001000100110110110111011111011110110001010111011011000110110010010111010001000110110010001111010111000010110111011111010111000111110111011111010011000001010010001111010111000000010000000111010111011110010110010111001111010110110001100 e9b6afe385baed89aee4bc8aec928feba39ae68789ebb0b8eba386e889b6efbd8aed8d92e88d91eb85bbeb8fbbe98291eb8080ebbcb2e7ad8c
UHC 鶯ㅺ퉮伊쒏룚應밸룆艶j퍒荑녻돻邑뀀뼲筌 1110010110100011101001001110101010111001100001101110110010100101100111001110011010001111100101101110101111101011101110011110101110001111100001011110011011111101101000111110101010111011100010011110110010111111100001101110100010001001101111101110101111101001101100101110101110010110101101011110111110100111 e5a3a4eab986eca59ce68f96ebebb9eb8f85e6fda3eabb89ecbf86e889beebe9b2eb96b5efa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)