To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????????h 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f68
SJIS-WIN 怏??乙?怏??乙ゅ??γ?怏??乙ゅ??γ?h 10011100100010010011111100111111100010011011001100111111100111001000100100111111001111111000100110110011100000101110001100111111001111111000001111000001001111111001110010001001001111110011111110001001101100111000001011100011001111110011111110000011110000010011111101101000 9c893f3f89b33f9c893f3f89b382e33f3f83c13f9c893f3f89b382e33f3f83c13f68
EUC-JP 怏??乙?怏??乙ゅ?洹γ?怏??乙ゅ?洹γ?h 1101011111101001001111110011111110110010101101010011111111010111111010010011111100111111101100101011010110100100111001010011111110001111110001111011101010100110110000110011111111010111111010010011111100111111101100101011010110100100111001010011111110001111110001111011101010100110110000110011111101101000 d7e93f3fb2b53fd7e93f3fb2b5a4e53f8fc7baa6c33fd7e93f3fb2b5a4e53f8fc7baa6c33f68
UTF-8 怏얘랩乙첡怏얘랩乙ゅ듋洹γ렍怏얘랩乙ゅ듋洹γ렍h 1110011010000000100011111110110010010110100110001110101110011110101010011110010010111001100110011110110010110010101000011110011010000000100011111110110010010110100110001110101110011110101010011110010010111001100110011110001110000010100001011110101110010011100010111110011010110100101110011100111010110011111010111010000010001101111001101000000010001111111011001001011010011000111010111001111010101001111001001011100110011001111000111000001010000101111010111001001110001011111001101011010010111001110011101011001111101011101000001000110101101000 e6808fec9698eb9ea9e4b999ecb2a1e6808fec9698eb9ea9e4b999e38285eb938be6b4b9ceb3eba08de6808fec9698eb9ea9e4b999e38285eb938be6b4b9ceb3eba08d68
UHC 怏얘랩乙첡怏얘랩乙ゅ듋洹γ렍怏얘랩乙ゅ듋洹γ렍h 1110010011101000101111101110101010110111101001101110101111100000101010110100101011100100111010001011111011101010101101111010011011101011111000001010101011100101100010101011111011101010101101111010010111100011100011101010001111100100111010001011111011101010101101111010011011101011111000001010101011100101100010101011111011101010101101111010010111100011100011101010001101101000 e4e8beeab7a6ebe0ab4ae4e8beeab7a6ebe0aae58abeeab7a5e38ea3e4e8beeab7a6ebe0aae58abeeab7a5e38ea368

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)