To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????\n}?????????\n{^ 00111111001111110011111100111111001111110011111100111111001111110011111101011100011011100111110100111111001111110011111100111111001111110011111100111111001111110011111101011100011011100111101101011110 3f3f3f3f3f3f3f3f3f5c6e7d3f3f3f3f3f3f3f3f3f5c6e7b5e
SJIS-WIN 鼇??鎰??酉??\n}鼇??鎰??酉??\n{^ 11101010100001110011111100111111111010000100110000111111001111111001001111010001001111110011111101011100011011100111110111101010100001110011111100111111111010000100110000111111001111111001001111010001001111110011111101011100011011100111101101011110 ea873f3fe84c3f3f93d13f3f5c6e7dea873f3fe84c3f3f93d13f3f5c6e7b5e
EUC-JP 鼇??鎰??酉??\n}鼇??鎰??酉??\n{^ 11110011111001110011111100111111111011111010110100111111001111111100011011010011001111110011111101011100011011100111110111110011111001110011111100111111111011111010110100111111001111111100011011010011001111110011111101011100011011100111101101011110 f3e73f3fefad3f3fc6d33f3f5c6e7df3e73f3fefad3f3fc6d33f3f5c6e7b5e
UTF-8 鼇앸뵃鎰묉샍酉담뀅\n}鼇앸뵃鎰묉샍酉담뀅\n{^ 11101001101111001000011111101100100101011011100011101011101101011000001111101001100011101011000011101011101011001000100111101100100000111000110111101001100001011000100111101011100010111011010011101011100000001000010101011100011011100111110111101001101111001000011111101100100101011011100011101011101101011000001111101001100011101011000011101011101011001000100111101100100000111000110111101001100001011000100111101011100010111011010011101011100000001000010101011100011011100111101101011110 e9bc87ec95b8ebb583e98eb0ebac89ec838de98589eb8bb4eb80855c6e7de9bc87ec95b8ebb583e98eb0ebac89ec838de98589eb8bb4eb80855c6e7b5e
UHC 鼇앸뵃鎰묉샍酉담뀅\n}鼇앸뵃鎰묉샍酉담뀅\n{^ 11101000101010001001110111101011100101001000100111101100111100001001000111100110100110001011101111101011101101111011010011100011100001011000000101011100011011100111110111101000101010001001110111101011100101001000100111101100111100001001000111100110100110001011101111101011101101111011010011100011100001011000000101011100011011100111101101011110 e8a89deb9489ecf091e698bbebb7b4e385815c6e7de8a89deb9489ecf091e698bbebb7b4e385815c6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)