To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 淙?雨祠持逡??豆?姐磐??枝?轅?^ 1001111111001000001111111000100101001010111000100100101110001110100111011110011110010101001111110011111110010011101001000011111110001000101101111001010011010110001111110011111110001110011111010011111111100111011101100011111101011110 9fc83f894ae24b8e9de7953f3f93a43f88b794d63f3f8e7d3fe7763f5e
EUC-JP 淙?雨祠持逡??豆?姐磐??枝?轅?^ 1101111011001010001111111011000110101011111000111010110010111011111111011110110111110101001111110011111111000110101001100011111110110000101110011100100011011000001111110011111110111011110111100011111111101101110101110011111101011110 deca3fb1abe3acbbfdedf53f3fc6a63fb0b9c8d83f3fbbde3fedd73f5e
UTF-8 淙렊雨祠持逡淚렜豆썼姐磐렰렞枝렋轅렢^ 11100110101101111001100111101011101000001000101011101001100110111010100011100111101001011010000011100110100011001000000111101001100000001010000111101111101001011000110111101011101000001001110011101000101100011000011011101100100011011011110011100101101001111001000011100111101000111001000011101011101000001011000011101011101000001001111011100110100111101001110111101011101000001000101111101000101111011000010111101011101000001010001001011110 e6b799eba08ae99ba8e7a5a0e68c81e980a1efa58deba09ce8b186ec8dbce5a790e7a390eba0b0eba09ee69e9deba08be8bd85eba0a25e
UHC 淙렊雨祠持逡淚렜豆썼姐磐렰렞枝렋轅렢^ 11110000111110001000111010100001111010011110101111011110111001101111001010100101111100011110010011010010111001111000111010101110110101001110011110111101111010001110111010111011110110101111000110001110101111011000111010101111111100101010101110001110101000101110101010111111100011101011001101011110 f0f88ea1e9ebdee6f2a5f1e4d2e78eaed4e7bde8eebbdaf18ebd8eaff2ab8ea2eabf8eb35e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)