To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 哀??肄?┸儒??繞???ゅ?儒??癲??游 10001000101000110011111100111111111000111110010100111111100001001011110110001110111100100011111100111111111000111000010100111111001111110011111110000010111000110011111110001110111100100011111100111111111000011001111100111111001111111001111111100000 88a33f3fe3e53f84bd8ef23f3fe3853f3f3f82e33f8ef23f3fe19f3f3f9fe0
EUC-JP 哀??肄?┸儒??繞??馹ゅ?儒??癲??游 101100001010010100111111001111111110011011100111001111111010100010111111101111001111010000111111001111111110010111100101001111110011111110001111111010011010000110100100111001010011111110111100111101000011111100111111111000101010000100111111001111111101111011100010 b0a53f3fe6e73fa8bfbcf43f3fe5e53f3f8fe9a1a4e53fbcf43f3fe2a13f3fdee2
UTF-8 哀노끀肄됵┸儒삳쐨繞볥쓣馹ゅ맫儒띠퐠癲용끇游 111001011001001110000000111010111000010110111000111010111000000110000000111010001000001010000100111010111001000010110101111000101001010010111000111001011000010010010010111011001000001010110011111011001001000010101000111001111011100110011110111010111011001110100101111011001001001110100011111010011010011010111001111000111000001010000101111010111010011110101011111001011000010010010010111010111001110110100000111011011001000010100000111001111001100110110010111011001001101010101001111010111000000110000111111001101011100010111000 e59380eb85b8eb8180e88284eb90b5e294b8e58492ec82b3ec90a8e7b99eebb3a5ec93a3e9a6b9e38285eba7abe58492eb9da0ed90a0e799b2ec9aa9eb8187e6b8b8
UHC 哀노끀肄됵┸儒삳쐨繞볥쓣馹ゅ맫儒띠퐠癲용끇游 1110010011101110101100111110101110000101101101101110110010111101100010011110111110100110101111111110101011100011101110111110101110011100100011011110100110100100100100111110101110011101100001001110110011110001101010101110010110010000101100111110101011100011101101101110110010111101100010011110111110100110101111111110101110000101101110111110101011111101 e4eeb3eb85b6ecbd89efa6bfeae3bbeb9c8de9a493eb9d84ecf1aae590b3eae3b6ecbd89efa6bfeb85bbeafd

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)