To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????ぜ喩??嵬?????揄λ?悠?? 001111110011111100111111001111110011111110000010101110101001101001100111001111110011111110011011110010100011111100111111001111110011111100111111100111011000100110000011110010010011111110010111010010010011111100111111 3f3f3f3f3f82ba9a673f3f9bca3f3f3f3f3f9d8983c93f97493f3f
EUC-JP ???沅?ぜ喩??嵬?????揄λ?悠?? 0011111100111111001111111000111111000110111010010011111110100100101111001101001111001000001111110011111111010110110011000011111100111111001111110011111100111111110110011110100110100110110010110011111111001101101010100011111100111111 3f3f3f8fc6e93fa4bcd3c83f3fd6cc3f3f3f3f3fd9e9a6cb3fcdaa3f3f
UTF-8 嶺뚮뿭沅좄ぜ喩쏆춷嵬됱뮆藺삣쮦揄λ봾悠뺧쫲 1110111110100110101010111110101110011010101011101110101110111111101011011110011010110010100001011110110010100010100001001110001110000001100111001110010110010110101010011110110010001111100001101110110010110110101101111110010110110101101011001110101110010000101100011110101110101110100001101110111110100111101100001110110010000010101000111110110010101110101001101110011010001111100001001100111010111011111010111011010010111110111001101000001010100000111010111011101010100111111011001010101110110010 efa6abeb9aaeebbfade6b285eca284e3819ce596a9ec8f86ecb6b7e5b5aceb90b1ebae86efa7b0ec82a3ecaea6e68f84cebbebb4bee682a0ebbaa7ecabb2
UHC 嶺뚮뿭沅좄ぜ喩쏆춷嵬됱뮆藺삣쮦揄λ봾悠뺧쫲 111001111010110110001100111010111001011110101101111010101011011010100000111010001010101010111100111010101110011110011011111011001010110110010011111010001110001110001001111011001001001010010101111011001110000110111011111001011010100010000011111010101111000110100101111010111001010010000101111010101110110110010101111011111010011010001010 e7ad8ceb97adeab6a0e8aabceae79becad93e8e389ec9295ece1bbe5a883eaf1a5eb9485eaed95efa68a

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)