To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鈺??節b?偃??節??堯??松ユ?倭?? 111110111100010000111111001111111001000011011111100000101000001000111111100110001110111000111111001111111001000011011111001111110011111111101010100111110011111100111111100011111011110010000011100001100011111110011000011000000011111100111111 fbc43f3f90df82823f98ee3f3f90df3f3fea9f3f3f8fbc83863f98603f3f
EUC-JP 鈺??節b?偃??節??堯??松ユ?倭?? 10001111111000111101010100111111001111111100000011100001101000111110001000111111110100001111000000111111001111111100000011100001001111110011111111110100101000010011111100111111101111101011111010100101111001100011111111001111110000010011111100111111 8fe3d53f3fc0e1a3e23fd0f03f3fc0e13f3ff4a13f3fbebea5e63fcfc13f3f
UTF-8 鈺싮뜈節b닽偃깁걠節븃춾堯뗰숲松ユ돮倭욆녉 111010011000100010111010111011001000101110101110111010111001110010001000111001111010111110000000111011111011110110000010111010111000101110111101111001011000000110000011111010101011100110000001111010101011000110100000111001111010111110000000111010111011100010000011111011001011011010111110111001011010000010101111111010111001011110110000111011001000100010110010111001101001110110111110111000111000001110100110111010111000111110101110111001011000000010101101111011001001101010000110111010111000010110001001 e988baec8baeeb9c88e7af80efbd82eb8bbde58183eab981eab1a0e7af80ebb883ecb6bee5a0afeb97b0ec88b2e69dbee383a6eb8faee580adec9a86eb8589
UHC 鈺싮뜈節b닽偃깁걠節븃춾堯뗰숲松ユ돮倭욆녉 111010001010110110011010111010011000110110001011111011111011110110100011111000101000100010101011111001011110011110110001111010011000000110001001111011111011110110111010111010001010110110011010111010001110101110001011111011111011110110100011111000011110011010101011111001101000100110110001111010001101111010011110111010001000011010111111 e8ad9ae98d8befbda3e288abe5e7b1e98189efbdbae8ad9ae8eb8befbda3e1e6abe689b1e8de9ee886bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)