To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 衡①?衡?》衡①?衡?》衡??衡②?B 100011010111010010000111010000000011111110001101011101000011111110000001011101001000110101110100100001110100000000111111100011010111010000111111100000010111010010001101011101000011111100111111100011010111010010000111010000010011111101000010 8d7487403f8d743f81748d7487403f8d743f81748d743f3f8d7487413f42
EUC-JP 衡??衡?》衡??衡?》衡??衡??B 101110011101010100111111001111111011100111010101001111111010000111010101101110011101010100111111001111111011100111010101001111111010000111010101101110011101010100111111001111111011100111010101001111110011111101000010 b9d53f3fb9d53fa1d5b9d53f3fb9d53fa1d5b9d53f3fb9d53f3f42
UTF-8 衡①왃衡⅛》衡①왃衡⅛》衡⅛옠衡②젃B 11101000101000011010000111100010100100011010000011101100100110011000001111101000101000011010000111100010100001011001101111100011100000001000101111101000101000011010000111100010100100011010000011101100100110011000001111101000101000011010000111100010100001011001101111100011100000001000101111101000101000011010000111100010100001011001101111101100100110001010000011101000101000011010000111100010100100011010000111101100101000001000001101000010 e8a1a1e291a0ec9983e8a1a1e2859be3808be8a1a1e291a0ec9983e8a1a1e2859be3808be8a1a1e2859bec98a0e8a1a1e291a1eca08342
UHC 衡①왃衡⅛》衡①왃衡⅛》衡⅛옠衡②젃B 11111011101011001010100011100111100111101011011011111011101011001010100011111011101000011011011111111011101011001010100011100111100111101011011011111011101011001010100011111011101000011011011111111011101011001010100011111011100111101010001011111011101011001010100011101000101000001000011101000010 fbaca8e79eb6fbaca8fba1b7fbaca8e79eb6fbaca8fba1b7fbaca8fb9ea2fbaca8e8a08742

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)