To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌??油??兢誼??儒??繞???θ? 1110001010100011001111110011111110010110111110110011111100111111100110010101110110001011011000100011111100111111100011101111001000111111001111111110001110000101001111110011111100111111100000111100011000111111 e2a33f3f96fb3f3f995d8b623f3f8ef23f3fe3853f3f3f83c63f
EUC-JP 筌??油??兢誼??儒??繞???θ? 1110010010100101001111110011111111001100111111010011111100111111110100011011111010110101110000110011111100111111101111001111010000111111001111111110010111100101001111110011111100111111101001101100100000111111 e4a53f3fccfd3f3fd1beb5c33f3fbcf43f3fe5e53f3f3fa6c83f
UTF-8 筌뚮뿦油답풚兢誼띰쭓儒멥뀊繞볥쓬劉θ퉪 1110011110101101100011001110101110011010101011101110101110111111101001101110011010110010101110011110101110001011101101011110110110010010100110101110010110000101101000101110100010101010101111001110101110011101101100001110110010101101100100111110010110000100100100101110101110101001101001011110101110000000100010101110011110111001100111101110101110110011101001011110110010010011101011001110111110100111100001111100111010111000111011011000100110101010 e7ad8ceb9aaeebbfa6e6b2b9eb8bb5ed929ae585a2e8aabceb9db0ecad93e58492eba9a5eb808ae7b99eebb3a5ec93acefa787ceb8ed89aa
UHC 筌뚮뿦油답풚兢誼띰쭓儒멥뀊繞볥쓬劉θ퉪 1110111110100111100011001110101110010111101001101110101011111010101101001110010010111110100111011101000011100111111010111111111010110110111011111010011110001011111010101110001110111000111000111000010110000110111010011010010010010011111010111001110110001100111010101110010110100101111010001011100110000010 efa78ceb97a6eafab4e4be9dd0e7ebfeb6efa78beae3b8e38586e9a493eb9d8ceae5a5e8b982

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)