To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h????????? 00111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f
SJIS-WIN 塋ゅ?弱?ぐ娃??h塋ゅ?弱?ぐ娃?? 1001101011001000100000101110001100111111100011101110001100111111100000101010111010001000101000010011111100111111011010001001101011001000100000101110001100111111100011101110001100111111100000101010111010001000101000010011111100111111 9ac882e33f8ee33f82ae88a13f3f689ac882e33f8ee33f82ae88a13f3f
EUC-JP 塋ゅ?弱?ぐ娃??h塋ゅ?弱?ぐ娃?? 1101010011001010101001001110010100111111101111001110010100111111101001001011000010110000101000110011111100111111011010001101010011001010101001001110010100111111101111001110010100111111101001001011000010110000101000110011111100111111 d4caa4e53fbce53fa4b0b0a33f3f68d4caa4e53fbce53fa4b0b0a33f3f
UTF-8 塋ゅ쩀弱딂ぐ娃쒎뎴h塋ゅ쩀弱딂ぐ娃쒎뎴 11100101101000011000101111100011100000101000010111101100101010011000000011100101101111001011000111101011100101001000001011100011100000011001000011100101101010001000001111101100100100101000111011101011100011101011010001101000111001011010000110001011111000111000001010000101111011001010100110000000111001011011110010110001111010111001010010000010111000111000000110010000111001011010100010000011111011001001001010001110111010111000111010110100 e5a18be38285eca980e5bcb1eb9482e38190e5a883ec928eeb8eb468e5a18be38285eca980e5bcb1eb9482e38190e5a883ec928eeb8eb4
UHC 塋ゅ쩀弱딂ぐ娃쒎뎴h塋ゅ쩀弱딂ぐ娃쒎뎴 11100111101010111010101011100101101001001001101011100101101100001000101011101000101010101011000011101000110111111001110011100101100010011000011101101000111001111010101110101010111001011010010010011010111001011011000010001010111010001010101010110000111010001101111110011100111001011000100110000111 e7abaae5a49ae5b08ae8aab0e8df9ce5898768e7abaae5a49ae5b08ae8aab0e8df9ce58987

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)