To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 燿??汚??燿??玉??燿??汚??燿??玉??^ 111000001010000000111111001111111000100110011000001111110011111111100000101000000011111100111111100010111100101000111111001111111110000010100000001111110011111110001001100110000011111100111111111000001010000000111111001111111000101111001010001111110011111101011110 e0a03f3f89983f3fe0a03f3f8bca3f3fe0a03f3f89983f3fe0a03f3f8bca3f3f5e
EUC-JP 燿??汚??燿??玉??燿??汚??燿??玉??^ 111000001010001000111111001111111011000111111000001111110011111111100000101000100011111100111111101101101100110000111111001111111110000010100010001111110011111110110001111110000011111100111111111000001010001000111111001111111011011011001100001111110011111101011110 e0a23f3fb1f83f3fe0a23f3fb6cc3f3fe0a23f3fb1f83f3fe0a23f3fb6cc3f3f5e
UTF-8 燿쒙쉔汚뤄슨燿쒙쉔玉졾눀燿쒙쉔汚뤄슨燿쒙쉔玉졿쑍^ 11100111100001111011111111101100100100101001100111101100100010011001010011100110101100011001101011101011101001001000010011101100100010101010100011100111100001111011111111101100100100101001100111101100100010011001010011100111100011101000100111101100101000011011111011101011100010001000000011100111100001111011111111101100100100101001100111101100100010011001010011100110101100011001101011101011101001001000010011101100100010101010100011100111100001111011111111101100100100101001100111101100100010011001010011100111100011101000100111101100101000011011111111101100100100011000110101011110 e787bfec9299ec8994e6b19aeba484ec8aa8e787bfec9299ec8994e78e89eca1beeb8880e787bfec9299ec8994e6b19aeba484ec8aa8e787bfec9299ec8994e78e89eca1bfec918d5e
UHC 燿쒙쉔汚뤄슨燿쒙쉔玉졾눀燿쒙쉔汚뤄슨燿쒙쉔玉졿쑍^ 11101000111111001001110011101111101111011010100011100111111111011011011111101111101111011011110011101000111111001001110011101111101111011010100011101000101011001010000011100101100001111010000111101000111111001001110011101111101111011010100011100111111111011011011111101111101111011011110011101000111111001001110011101111101111011010100011101000101011001010000011100110100111001010110001011110 e8fc9cefbda8e7fdb7efbdbce8fc9cefbda8e8aca0e587a1e8fc9cefbda8e7fdb7efbdbce8fc9cefbda8e8aca0e69cac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)