To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 褥?(??━蹂△?厄μ?韋?褥?(??━ 11100101111100010011111110000001011010010011111100111111100001001010101011100110111110001000000110100010001111111001011011101111100000111100101000111111111010001110100000111111111001011111000100111111100000010110100100111111001111111000010010101010 e5f13f81693f3f84aae6f881a23f96ef83ca3fe8e83fe5f13f81693f3f84aa
EUC-JP 褥?(??━蹂△?厄μ?韋?褥?(??━ 11101010111100110011111110100001110010100011111100111111101010001010110011101100111110101010001010100100001111111100110011110001101001101100110000111111111100001110101000111111111010101111001100111111101000011100101000111111001111111010100010101100 eaf33fa1ca3f3fa8acecfaa2a43fccf1a6cc3ff0ea3feaf33fa1ca3f3fa8ac
UTF-8 褥띕(痢뺧━蹂△뵶厄μ뢾韋펅褥띕(痢뺧━ 1110100010100100101001011110101110011101100101011110111110111100100010001110111110100111101001011110101110111010101001111110001010010100100000011110100010111001100000101110001010010110101100111110101110110101101101101110010110001110100001001100111010111100111010111010001010111110111010011001111110001011111011011000111010000101111010001010010010100101111010111001110110010101111011111011110010001000111011111010011110100101111010111011101010100111111000101001010010000001 e8a4a5eb9d95efbc88efa7a5ebbaa7e29481e8b982e296b3ebb5b6e58e84cebceba2bee99f8bed8e85e8a4a5eb9d95efbc88efa7a5ebbaa7e29481
UHC 褥띕(痢뺧━蹂△뵶厄μ뢾韋펅褥띕(痢뺧━ 11101001101100111011011011101011101000111010100011101100101110001001010111101111101001101010110011101011101100111010000111100010100101001011010011100100111110001010010111101100100011111000000111101010110111111011110001011000111010011011001110110110111010111010001110101000111011001011100010010101111011111010011010101100 e9b3b6eba3a8ecb895efa6acebb3a1e294b4e4f8a5ec8f81eadfbc58e9b3b6eba3a8ecb895efa6ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)