To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 烏△?艦? 率?易[烏△?艦? 率?易[^ 100010010100011110000001101000100011111110001010110011010011111110000001010000001001011110100110001111111000100011010101010110111000100101000111100000011010001000111111100010101100110100111111100000010100000010010111101001100011111110001000110101010101101101011110 894781a23f8acd3f814097a63f88d55b894781a23f8acd3f814097a63f88d55b5e
EUC-JP 烏△?艦? 率?易[烏△?艦? 率?易[^ 101100011010100010100010101001000011111110110100110011110011111110100001101000011100111010101000001111111011000011010111010110111011000110101000101000101010010000111111101101001100111100111111101000011010000111001110101010000011111110110000110101110101101101011110 b1a8a2a43fb4cf3fa1a1cea83fb0d75bb1a8a2a43fb4cf3fa1a1cea83fb0d75b5e
UTF-8 烏△뀤艦김 率쏓易[烏△뀤艦김 率쏓易[^ 111001111000001110001111111000101001011010110011111010111000000010100100111010001000100110100110111010101011100110000000111000111000000010000000111001111000111010000111111011001000111110010011111001101001100010010011010110111110011110000011100011111110001010010110101100111110101110000000101001001110100010001001101001101110101010111001100000001110001110000000100000001110011110001110100001111110110010001111100100111110011010011000100100110101101101011110 e7838fe296b3eb80a4e889a6eab980e38080e78e87ec8f93e698935be7838fe296b3eb80a4e889a6eab980e38080e78e87ec8f93e698935b5e
UHC 烏△뀤艦김 率쏓易[烏△뀤艦김 率쏓易[^ 111010001010000110100001111000101000010110011011111110011110011010110001111010001010000110100001111000011110001110011011111110011110011010110110010110111110100010100001101000011110001010000101100110111111100111100110101100011110100010100001101000011110000111100011100110111111100111100110101101100101101101011110 e8a1a1e2859bf9e6b1e8a1a1e1e39bf9e6b65be8a1a1e2859bf9e6b1e8a1a1e1e39bf9e6b65b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)