To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲l?竊???有??酉??畑????? 11100001100111111000001010001100001111111110001010000110001111110011111100111111100101110100110000111111001111111001001111010001001111110011111110010100101010000011111100111111001111110011111100111111 e19f828c3fe2863f3f3f974c3f3f93d13f3f94a83f3f3f3f3f
EUC-JP 癲l?竊??琯有??酉??畑??佾?? 1110001010100001101000111110110000111111111000111110011000111111001111111000111111001100101100111100110110101101001111110011111111000110110100110011111100111111110010001010101000111111001111111000111110110000111110110011111100111111 e2a1a3ec3fe3e63f3f8fccb3cdad3f3fc6d33f3fc8aa3f3f8fb0fb3f3f
UTF-8 癲l옓竊덅첀琯有뷂쭔酉⑷괠畑띕뀪佾뚩굜 111001111001100110110010111011111011110110001100111011001001100010010011111001111010101110001010111010111000110110000101111011001011001010000000111001111001000010101111111001101001110010001001111010111011011110000010111011001010110110010100111010011000010110001001111000101001000110110111111010101011010010100000111001111001010110010001111010111001110110010101111010111000000010101010111001001011110110111110111010111001101010101001111010101011010110011100 e799b2efbd8cec9893e7ab8aeb8d85ecb280e790afe69c89ebb782ecad94e98589e291b7eab4a0e79591eb9d95eb80aae4bdbeeb9aa9eab59c
UHC 癲l옓竊덅첀琯有뷂쭔酉⑷괠畑띕뀪佾뚩굜 1110111110100110101000111110110010011110100110011110111110111100100010001110100010101010100011011100111010110101111010101111001110010100111011111010011110001100111010111011011110101001111010101011000110100111111011111010010110110110111010111000010110100000111011001110101110001100111010001000001010000100 efa6a3ec9e99efbc88e8aa8dceb5eaf394efa78cebb7a9eab1a7efa5b6eb85a0eceb8ce88284

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)