To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 闇λ?爾e?蹂??[闇λ?爾e?蹂??[^ 10001000110001011000001111001001001111111000111010100010100000101000010100111111111001101111100000111111001111110101101110001000110001011000001111001001001111111000111010100010100000101000010100111111111001101111100000111111001111110101101101011110 88c583c93f8ea282853fe6f83f3f5b88c583c93f8ea282853fe6f83f3f5b5e
EUC-JP 闇λ?爾e?蹂??[闇λ?爾e?蹂??[^ 10110000110001111010011011001011001111111011110010100100101000111110010100111111111011001111101000111111001111110101101110110000110001111010011011001011001111111011110010100100101000111110010100111111111011001111101000111111001111110101101101011110 b0c7a6cb3fbca4a3e53fecfa3f3f5bb0c7a6cb3fbca4a3e53fecfa3f3f5b5e
UTF-8 闇λ퀫爾e깷蹂잛끽[闇λ퀫爾e깷蹂잛끽[^ 11101001100101111000011111001110101110111110110110000000101010111110011110001000101111101110111110111101100001011110101010111001101101111110100010111001100000101110110010011110100110111110101110000001101111010101101111101001100101111000011111001110101110111110110110000000101010111110011110001000101111101110111110111101100001011110101010111001101101111110100010111001100000101110110010011110100110111110101110000001101111010101101101011110 e99787cebbed80abe788beefbd85eab9b7e8b982ec9e9beb81bd5be99787cebbed80abe788beefbd85eab9b7e8b982ec9e9beb81bd5b5e
UHC 闇λ퀫爾e깷蹂잛끽[闇λ퀫爾e깷蹂잛끽[^ 111001001110000110100101111010111011001110011111111011001011001110100011111001011000001110100101111010111011001110011111111011001011001110100011010110111110010011100001101001011110101110110011100111111110110010110011101000111110010110000011101001011110101110110011100111111110110010110011101000110101101101011110 e4e1a5ebb39fecb3a3e583a5ebb39fecb3a35be4e1a5ebb39fecb3a3e583a5ebb39fecb3a35b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)