To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 梧??節??張??猥??鶯??燿③?癌?? 1000110011100110001111110011111110010000110111110011111100111111100100101010001100111111001111111110000011001110001111110011111111101001111100100011111100111111111000001010000010000111010000100011111110001010111000000011111100111111 8ce63f3f90df3f3f92a33f3fe0ce3f3fe9f23f3fe0a087423f8ae03f3f
EUC-JP 梧??節??張??猥??鶯??燿??癌?? 10111000111010000011111100111111110000001110000100111111001111111100010010100101001111110011111111100000110100000011111100111111111100101111010000111111001111111110000010100010001111110011111110110100111000100011111100111111 b8e83f3fc0e13f3fc4a53f3fe0d03f3ff2f43f3fe0a23f3fb4e23f3f
UTF-8 梧잍뇣節삣뜳張믦쵟猥롳숱鶯긷뼤燿③캒癌댈쓻 111001101010001010100111111011001001111010001101111010111000011110100011111001111010111110000000111011001000001010100011111010111001110010110011111001011011110010110101111010111010111110100110111011001011010110011111111001111000110010100101111010111010000110110011111011001000100010110001111010011011011010101111111010101011100010110111111010111011110010100100111001111000011110111111111000101001000110100010111011001011101010010010111001111001100110001100111010111000110010001000111011001001001110111011 e6a2a7ec9e8deb87a3e7af80ec82a3eb9cb3e5bcb5ebafa6ecb59fe78ca5eba1b3ec88b1e9b6afeab8b7ebbca4e787bfe291a2ecba92e7998ceb8c88ec93bb
UHC 梧잍뇣節삣뜳張믦쵟猥롳숱鶯긷뼤燿③캒癌댈쓻 111001111111110010011111111001101000011110001011111011111011110110111011111001011000110110110001111011011110010110010010111010001010110010100000111010001110010110001110111011111011110110100010111001011010001110110001111001011001011010100111111010001111110010101000111010011010111110011011111001001101111110110100111011101001110110010110 e7fc9fe6878befbdbbe58db1ede592e8aca0e8e58eefbda2e5a3b1e596a7e8fca8e9af9be4dfb4ee9d96

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)