To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?脹???珪?瑕漿?幀惚???夏呈?? 001111111001001010101111001111110011111100111111100011000101110100111111111000001110101010011111111101110011111110011011111010101000110110011011001111110011111100111111100010011100010010010010111001100011111100111111 3f92af3f3f3f8c5d3fe0ea9ff73f9bea8d9b3f3f3f89c492e63f3f
EUC-JP ?脹???珪?瑕漿?幀惚???夏呈?? 001111111100010010110001001111110011111100111111101101111011111000111111111000001110110011011110111110010011111111010110111011001011100111111011001111110011111100111111101100101100011011000100111010000011111100111111 3fc4b13f3f3fb7be3fe0ecdef93fd6ecb9fb3f3f3fb2c6c4e83f3f
UTF-8 뤋脹쫸샘억珪뤋瑕漿궐幀惚샅렑뤋夏呈쮲샘 111010111010010010001011111010001000010010111001111011001010101110111000111011001000001110011000111011001001011010110101111001111000111110101010111010111010010010001011111001111001000110010101111001101011110010111111111010101011011010010000111001011011100110000000111001101000001110011010111011001000001110000101111010111010000010010001111010111010010010001011111001011010010010001111111001011001000110001000111011001010111010110010111011001000001110011000 eba48be884b9ecabb8ec8398ec96b5e78faaeba48be79195e6bcbfeab690e5b980e6839aec8385eba091eba48be5a48fe59188ecaeb2ec8398
UHC 뤋脹쫸샘억珪뤋瑕漿궐幀惚샅렑뤋夏呈쮲샘 1000111110111011111100111110110010100110100011111011101111111001101111101110111111010000101010001000111110111011111110011100001011101101111011001011000111001000111011111101001111111011111011011011101111110100100011101010011010001111101110111111100110111110111011111101000010101000100011111011101111111001 8fbbf3eca68fbbf9beefd0a88fbbf9c2edecb1c8efd3fbedbbf48ea68fbbf9beefd0a88fbbf9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)