To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 張??撓??節?ぜ節??帳??將??鈺?? 1001001010100011001111110011111110011101100110100011111100111111100100001101111100111111100000101011101010010000110111110011111100111111100100101010000000111111001111111001101110010010001111110011111111111011110001000011111100111111 92a33f3f9d9a3f3f90df3f82ba90df3f3f92a03f3f9b923f3ffbc43f3f
EUC-JP 張??撓??節?ぜ節??帳??將??鈺?? 110001001010010100111111001111111101100111111010001111110011111111000000111000010011111110100100101111001100000011100001001111110011111111000100101000100011111100111111110101011111001000111111001111111000111111100011110101010011111100111111 c4a53f3fd9fa3f3fc0e13fa4bcc0e13f3fc4a23f3fd5f23f3f8fe3d53f3f
UTF-8 張ㅿ슭撓뷂슁節면ぜ節띄뼻帳뤄쉼將됵슬鈺섒눃 111001011011110010110101111000111000010110111111111011001000101010101101111001101001001010010011111010111011011110000010111011001000101010000001111001111010111110000000111010111010100110110100111000111000000110011100111001111010111110000000111010111001110110000100111010111011110010111011111001011011100010110011111010111010010010000100111011001000100110111100111001011011000010000111111010111001000010110101111011001000101010101100111010011000100010111010111011001000010010010010111010111000100010000011 e5bcb5e385bfec8aade69293ebb782ec8a81e7af80eba9b4e3819ce7af80eb9d84ebbcbbe5b8b3eba484ec89bce5b087eb90b5ec8aace988baec8492eb8883
UHC 張ㅿ슭撓뷂슁節면ぜ節띄뼻帳뤄쉼將됵슬鈺섒눃 111011011110010110100100111011111011110110111110111010001111010110010100111011111011110110110011111011111011110110111000111010011010101010111100111011111011110110110110111001111001011010111110111011011110001110110111111011111011110110110000111011011110001010001001111011111011110110111101111010001010110110011000111011101000011110100100 ede5a4efbdbee8f594efbdb3efbdb8e9aabcefbdb6e796beede3b7efbdb0ede289efbdbde8ad98ee87a4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)