To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??????攸?????宥??酉??永??裕?? 0011111100111111001111110011111100111111001111111001110110111111001111110011111100111111001111110011111110010111010001110011111100111111100100111101000100111111001111111000100101101001001111110011111110010111010101000011111100111111 3f3f3f3f3f3f9dbf3f3f3f3f3f97473f3f93d13f3f89693f3f97543f3f
EUC-JP ??????攸?????宥??酉??永??裕?? 0011111100111111001111110011111100111111001111111101101011000001001111110011111100111111001111110011111111001101101010000011111100111111110001101101001100111111001111111011000111001010001111110011111111001101101101010011111100111111 3f3f3f3f3f3fdac13f3f3f3f3fcda83f3fc6d33f3fb1ca3f3fcdb53f3f
UTF-8 咽됱빆杻㎬쯁攸됲뮏輦깆룜宥껅샍酉고닍永띠옃裕드컜 111011111010011010011110111010111001000010110001111010111011100110000110111011111010011110001000111000111000111010101100111011001010111110000001111001101001010010111000111010111001000010110010111010111010111010001111111011111010011010011000111010101011100110000110111010111010001110011100111001011010111010100101111010101011101110000101111011001000001110001101111010011000010110001001111010101011001110100000111010111000101110001101111001101011000010111000111010111001110110100000111011001001100010000011111010001010001110010101111010111001001110011100111011001011101110011100 efa69eeb90b1ebb986efa788e38eacecaf81e694b8eb90b2ebae8fefa698eab986eba39ce5aea5eabb85ec838de98589eab3a0eb8b8de6b0b8eb9da0ec9883e8a395eb939cecbb9c
UHC 咽됱빆杻㎬쯁攸됲뮏輦깆룜宥껅샍酉고닍永띠옃裕드컜 111001101110110010001001111011001001010110101101111010101111010010100111111010001010100010011101111010101111001010001001111011011001001010011100111001101110010010110001111011001000111110011000111010101110100110000011111001101001100010111011111010111011011110110000111011011000100010010011111001111011010110110110111011001001111010001111111010111010111010110101111001011011000010000111 e6ec89ec95adeaf4a7e8a89deaf289ed929ce6e4b1ec8f98eae983e698bbebb7b0ed8893e7b5b6ec9e8febaeb5e5b087

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)