To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 迢ク螻樔サ也矯譌乗クャ迢ク雉願「也矯雉 1110011110001011101110001110010110110001100111101110010010111011100101101110011110001011101110001110011010010111100011111110011010111000101011001110011110001011101110001110100010110011100010101110100010100010100101101110011110001011101110001110100010110011 e78bb8e5b19ee4bb96e78bb8e6978fe6b8ace78bb8e8b38ae8a296e78bb8e8b3
EUC-JP 迢ク螻樔サ也矯譌乗クャ迢ク雉願「也矯雉 1110110111101011100011101011100011101010101100111101110011100110100011101011101111001100111010011011011010111010111010111111011110111110111010001000111010111000100011101010110011101101111010111000111010111000111100001011010110110100111010101000111010100010110011001110100110110110101110101111000010110101 edeb8eb8eab3dce68ebbcce9b6baebf7bee88eb88eacedeb8eb8f0b5b4ea8ea2cce9b6baf0b5
UTF-8 迢ク螻樔サ也矯譌乗クャ迢ク雉願「也矯雉 111010001011111110100010111011111011110110111000111010001001111010111011111001101010100010010100111011111011110110111011111001001011100110011111111001111001111110101111111010001010110110001100111001001011100110010111111011111011110110111000111011111011110110101100111010001011111110100010111011111011110110111000111010011001101110001001111010011010000110011000111011111011110110100010111001001011100110011111111001111001111110101111111010011001101110001001 e8bfa2efbdb8e89ebbe6a894efbdbbe4b99fe79fafe8ad8ce4b997efbdb8efbdace8bfa2efbdb8e99b89e9a198efbda2e4b99fe79fafe99b89
UHC ?????也矯??????雉願?也矯雉 0011111100111111001111110011111100111111111001011010010111001110111011000011111100111111001111110011111100111111001111111111011011001011111010101100001100111111111001011010010111001110111011001111011011001011 3f3f3f3f3fe5a5ceec3f3f3f3f3f3ff6cbeac33fe5a5ceecf6cb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)