To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嶸??揖??蹂??嶸??揖??蟻??驛??揖 111110101011010000111111001111111001011101001011001111110011111111100110111110000011111100111111111110101011010000111111001111111001011101001011001111110011111110001011011000010011111100111111111010011000001100111111001111111001011101001011 fab43f3f974b3f3fe6f83f3ffab43f3f974b3f3f8b613f3fe9833f3f974b
EUC-JP 嶸??揖??蹂??嶸??揖?ˇ蟻??驛??揖 10001111101110111111010000111111001111111100110110101100001111110011111111101100111110100011111100111111100011111011101111110100001111110011111111001101101011000011111110001111101000101011000010110101110000100011111100111111111100011110001100111111001111111100110110101100 8fbbf43f3fcdac3f3fecfa3f3f8fbbf43f3fcdac3f8fa2b0b5c23f3ff1e33f3fcdac
UTF-8 嶸뗭옚揖졿솮蹂⑹뒛嶸뗭옚揖욘ˇ蟻욎돺驛노돍揖 1110010110110110101110001110101110010111101011011110110010011000100110101110011010001111100101101110110010100001101111111110110010000110101011101110100010111001100000101110001010010001101110011110101110010010100110111110010110110110101110001110101110010111101011011110110010011000100110101110011010001111100101101110110010011010100110001100101110000111111010001001111110111011111011001001101010001110111010111000111110111010111010011010100110011011111010111000010110111000111010111000111110001101111001101000111110010110 e5b6b8eb97adec989ae68f96eca1bfec86aee8b982e291b9eb929be5b6b8eb97adec989ae68f96ec9a98cb87e89fbbec9a8eeb8fbae9a99beb85b8eb8f8de68f96
UHC 嶸뗭옚揖졿솮蹂⑹뒛嶸뗭옚揖욘ˇ蟻욎돺驛노돍揖 1110011110101110100010111110110010011110100111101110101111100111101000001110011010011001101001001110101110110011101010011110110010001010100110001110011110101110100010111110110010011110100111101110101111100111101111111110011010100010101001111110101111111100100111101110110010001001101111011110011010111110101100111110101110001001100110111110101111100111 e7ae8bec9e9eebe7a0e699a4ebb3a9ec8a98e7ae8bec9e9eebe7bfe6a2a7ebfc9eec89bde6beb3eb899bebe7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)