To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 褥??歪②?炎??節??絶??裔??蜈?∮ 111001011111000100111111001111111001100001100011100001110100000100111111100010011000101000111111001111111001000011011111001111110011111110010000111000100011111100111111111001011110000100111111001111111110010110000101001111111000011110010011 e5f13f3f986387413f898a3f3f90df3f3f90e23f3fe5e13f3fe5853f8793
EUC-JP 褥??歪??炎??節??絶??裔??蜈?? 11101010111100110011111100111111110011111100010000111111001111111011000111101010001111110011111111000000111000010011111100111111110000001110010000111111001111111110101011100011001111110011111111101001111001010011111100111111 eaf33f3fcfc43f3fb1ea3f3fc0e13f3fc0e43f3feae33f3fe9e53f3f
UTF-8 褥곤푴歪②굜炎ㅹ쑚節㏝낡絶귡궓裔꾤쑄蜈띺∮ 111010001010010010100101111010101011001110100100111011011001000110110100111001101010110110101010111000101001000110100001111010101011010110011100111001111000001010001110111000111000010110111001111011001001000110011010111001111010111110000000111000111000111110011101111010111000001010100001111001111011010110110110111010101011011110100001111010101011011010010011111010001010001110010100111010101011111010100100111011001001000110000100111010001001110010001000111010111001110110111010111000101000100010101110 e8a4a5eab3a4ed91b4e6adaae291a1eab59ce7828ee385b9ec919ae7af80e38f9deb82a1e7b5b6eab7a1eab693e8a394eabea4ec9184e89c88eb9dbae288ae
UHC 褥곤푴歪②굜炎ㅹ쑚節㏝낡絶귡궓裔꾤쑄蜈띺∮ 111010011011001110110000111011111011111010000010111010001110000010101000111010001000001010000100111001101111101010100100111010011001110010111001111011111011110110100111111010011011001110110000111011111011111010000010111010011000001010101000111001111110000010000100111001111001110010100100111010001010010110001101111010011010001010110001 e9b3b0efbe82e8e0a8e88284e6faa4e99cb9efbda7e9b3b0efbe82e982a8e7e084e79ca4e8a58de9a2b1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)