To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??裕??癒る?域??悠??膺??永 100010010101000100111111001111111001011101010100001111110011111110010110111111001000001011101001001111111000100011100110001111110011111110010111010010010011111100111111111001000101111000111111001111111000100101101001 89513f3f97543f3f96fc82e93f88e63f3f97493f3fe45e3f3f8969
EUC-JP 渦??裕??癒る?域??悠??膺??永 101100011011001000111111001111111100110110110101001111110011111111001100111111101010010011101011001111111011000011101000001111110011111111001101101010100011111100111111111001111011111100111111001111111011000111001010 b1b23f3fcdb53f3fccfea4eb3fb0e83f3fcdaa3f3fe7bf3f3fb1ca
UTF-8 渦기뫁裕뗦끽癒る쎗域㏐퀣悠밧웻膺덊닄永 111001101011100010100110111010101011100010110000111010111010101110000001111010001010001110010101111010111001011110100110111010111000000110111101111001111001100110010010111000111000001010001011111011001000111010010111111001011001111110011111111000111000111110010000111011011000000010100011111001101000001010100000111010111011000010100111111011001001101110111011111010001000011010111010111010111000110110001010111010111000101110000100111001101011000010111000 e6b8a6eab8b0ebab81e8a395eb97a6eb81bde79992e3828bec8e97e59f9fe38f90ed80a3e682a0ebb0a7ec9bbbe886baeb8d8aeb8b84e6b0b8
UHC 渦기뫁裕뗦끽癒る쎗域㏐퀣悠밧웻膺덊닄永 1110100010111110101100011110001010010001101001011110101110101110100010111110011010110011101000111110101110101000101010101110101110011011101111101110011010110100101001111110101010110011100101111110101011101101101110011110010110011111100001111110101111101100100010001110110110001000100011011110011110110101 e8beb1e291a5ebae8be6b3a3eba8aaeb9bbee6b4a7eab397eaedb9e59f87ebec88ed888de7b5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)