To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏??悠??節???△?爰??柔レ?沃 100010010100011100111111001111111001011101001001001111110011111110010000110111110011111100111111001111111000000110100010001111111110000010100111001111110011111110001111010111111000001110001100001111111001011110000000 89473f3f97493f3f90df3f3f3f81a23fe0a73f3f8f5f838c3f9780
EUC-JP 烏??悠??節??繇△?爰??柔レ?沃 1011000110101000001111110011111111001101101010100011111100111111110000001110000100111111001111111000111111010100110100011010001010100100001111111110000010101001001111110011111110111101110000001010010111101100001111111100110111100000 b1a83f3fcdaa3f3fc0e13f3f8fd4d1a2a43fe0a93f3fbdc0a5ec3fcde0
UTF-8 烏띾맧悠됵쭓節륁춷繇△돧爰곭춯柔レ뵯沃 111001111000001110001111111010111001110110111110111010111010011110100111111001101000001010100000111010111001000010110101111011001010110110010011111001111010111110000000111010111010010110000001111011001011011010110111111001111011100110000111111000101001011010110011111010111000111110100111111001111000100010110000111010101011001110101101111011001011011010101111111001101001111110010100111000111000001110101100111010111011010110101111111001101011001010000011 e7838feb9dbeeba7a7e682a0eb90b5ecad93e7af80eba581ecb6b7e7b987e296b3eb8fa7e788b0eab3adecb6afe69f94e383acebb5afe6b283
UHC 烏띾맧悠됵쭓節륁춷繇△돧爰곭춯柔レ뵯沃 1110100010100001100011011110101110010000101100001110101011101101100010011110111110100111100010111110111110111101100011111110110010101101100100111110100110100011101000011110001010001001101010111110101010111010100000011110011110101101100011001110101011110101101010111110110010010100101011011110100010101010 e8a18deb90b0eaed89efa78befbd8fecad93e9a3a1e289abeaba81e7ad8ceaf5abec94ade8aa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)