To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?脹???珪?泌呈??瀨?翹?紐珪?韓 00111111100100101010111100111111001111110011111110001100010111010011111110010100111001011001001011100110001111110011111111111011010100000011111111100011110010010011111110010101010100101000110001011101001111111000101011011000 3f92af3f3f3f8c5d3f94e592e63f3ffb503fe3c93f95528c5d3f8ad8
EUC-JP ?脹???珪?泌呈????翹?紐珪?韓 001111111100010010110001001111110011111100111111101101111011111000111111110010001110011111000100111010000011111100111111001111110011111111100110110010110011111111001001101100111011011110111110001111111011010011011010 3fc4b13f3f3fb7be3fc8e7c4e83f3f3f3fe6cb3fc9b3b7be3fb4da
UTF-8 뤋脹쫸샘폄珪뤋泌呈쮲샘瀨렠翹렡紐珪뤋韓 111010111010010010001011111010001000010010111001111011001010101110111000111011001000001110011000111011011000111110000100111001111000111110101010111010111010010010001011111001101011001110001100111001011001000110001000111011001010111010110010111011001000001110011000111001111000000010101000111010111010000010100000111001111011111110111001111010111010000010100001111001111011010010010000111001111000111110101010111010111010010010001011111010011001111110010011 eba48be884b9ecabb8ec8398ed8f84e78faaeba48be6b38ce59188ecaeb2ec8398e780a8eba0a0e7bfb9eba0a1e7b490e78faaeba48be99f93
UHC 뤋脹쫸샘폄珪뤋泌呈쮲샘瀨렠翹렡紐珪뤋韓 1000111110111011111100111110110010100110100011111011101111111001110001101110111111010000101010001000111110111011111110011011001011101111110100001010100010001111101110111111100111010110111011101000111010110001110011101110111010001110101100101101001011101111110100001010100010001111101110111111100111011011 8fbbf3eca68fbbf9c6efd0a88fbbf9b2efd0a88fbbf9d6ee8eb1ceee8eb2d2efd0a88fbbf9db

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)